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1 Introduction 

This chapter provides an introduction to the following lectures. It is based 
on nine lectures given at the Saas Fee school in March 2008. Much, but not 
all of this material is contained in [38j and [53j . but with more more em- 
phasis on current research in millimeter/sub- millimeter astronomy. In the 
following, the basics of radiative transfer, receivers, antennas, interferometry 
radiation mechanisms and molecules are presented. The field of mm/sub-mm 
astronomy has become very large, so this presentation contains only a few 
derivations. The astute reader will note that Maxwell's Equations do not 
appear at all, although the results are based on these. In some cases, the 
approach is to quote a result followed by an example. This contribution is 
aimed at graduate students with a good background in physics and astron- 
omy. Some common terms are used but "jargon" has been avoided as much as 
possible. References are given, for the most part, to more recent work where 
citations to earlier work can be found. The units are mostly CGS with some 
SI units. This follows the usage in the astronomy literature. One topic not 
covered here is polarization; see [3S] and 05] for an introduction. 

The field of mm/sub-mm astronomy began only in the 1960's, but the 
richness of the results has justified this series of lectures. Millimeter/sub- 
mm measurements require excellent weather, very accurate antennas, and 
sensitive receivers. The interpretation of these data requires a knowledge of 
atomic and molecular physics, radiative transfer and interstellar chemistry. 
By number, about 90% of molecular clouds consist of H2. However, the de- 
termination of local densities and column densities of molecular hydrogen, 
H2 must be indirect since II2 does not emit spectral lines in cooler clouds. 
The Schrodinger equation governs all chemistry, but the conditions in the 
interstellar medium are very different (and much more varied) than those on 
earth. Earth-bound chemistry is only a subset of the more general interstel- 
lar chemistry; it is dominated by non-equilibrium processes that occur at low 
temperatures and densities, so determinations of collision and reaction rates 
are needed. A project devoted to this goal is the "Molecular Universe" (see 
the web site for details) . For an overview of interstellar chemistry, the reader 
is referred to the short presentation [21] or the monograph [47]. Although 
interstellar chemistry plays a very important role in molecular line astron- 
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omy, it is not included due to space limitations. Another glaring omission is 
a treatment of the Cosmic Microwave Background (CMB), since this is not 
treated in the following two lectures. Technically, the CMB emission is not 
weak, but does fill the entire sky, so special techniques must be employed. 

Mm/sub- mm astronomy is one of the most important tools to study the 
birth of stars [45], [37] and star formation in galaxies [44|. Stars form in 
molecular clouds where extinction is large. Thus, near infrared and optical 
studies are of limited value. At high red shifts, many active star forming 
galaxies and Active Galactic Nuclei (AGN's) are enshrouded by dust ^45] . 
Synchrotron emission at cm wavelengths from AGN's allows sub-arcsecond 
resolution images of relativistic electrons moving in B fields [42]. However, 
the imaging of dust and molecules on comparable scales is just beginning, so 
this is an area where important contributions can be made. 

The closest example of a supermassive Black Hole is Sgr A*, 8.5 kpc (1 
kpc=3.08 X 10^^ cm) from the Sun. Sgr A* is quiescent at present, but is 
thought to be similar to the "engines" that power AGN's. The radio emission 
from Sgr A* is interpreted as optically thick synchrotron emission, so with 
mm/sub- mm Very long Baseline interferometry, gas on Astronomical Unit 
(1 AU=1.5 X 10^'^ cm) scales, close to the Schwarzschild radius, can be 
sampled. 

In the field of star formation, we believe that all of the physical principles 
arc known. However, star formation is complex and measurements are needed 
to determine which processes are dominant and which can be neglected. In 
contrast, for studies of the early universe. Black Holes and AGN's not all of 
the relevant physical laws may be known. As in all of astronomy, measure- 
ments often lead to unexpected results that may have far reaching effects on 
our understanding of fundamental physical laws. 

The dominant continuum radiation mechanism in the mm/sub- mm wave- 
length range is dust emission. This differs from free- free and synchrotron 
emission, which is commonly encountered at centimeter wavelengths [13] . 
Spectral line radiation is dominated by thermal and quasi-thermal molecular 
radiation, although there are a few important atomic lines of carbon, oxygen 
and nitrogen. 

The sensitivity at mm/sub-mm wavelengths has become about 100 times 
better than was the case in the 1960's |20j. However sensitivity alone is not 
enough to transform the field. Rather, high resolution images are needed. 
This will change when the Atacama Large Millimeter Array (ALMA) begins 
operation. ALMA will initially operate between 3.5 mm and 0.4 mm. ALMA 
has a unique combination of high angular resolution and high sensitivity. 
This will allow ALMA to transform the field of mm/sub-mm astronomy. A 
summary of the science planned for ALMA is to be found in [S]. 
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2 Some Background 

In this section, we review the basics needed in foUowing sections (see pS]). 
Electromagnetic radiation in the radio window can be interpreted as a wave 
phenomenon, i. e. in term of classical physics. When the scale of the system 
involved is much larger than a wavelength, we can consider the radiation to 
travel in straight lines or rays. The power, dP, intercepted by a infinitesimal 
surface dcr is 

dP = cos 6* dl2 dcr dzy (1) 

where 

dP = power, in watts, 
da — area of surface, m^, 
dv = bandwidth, in Hz, 

= angle between the normal to da and the direction to di7, 
lu = brightness or specific intensity, in Wm^^ Hz^^ sr^^. 

Equation ([T]) is the definition of the brightness 1^. Quite often the term 
intensity or specific intensity is used instead of the term brightness. We 
will use all three designations interchangeably. 

The total flux of a source is obtained by integrating ([T]) over the total 
solid angle fig subtended by the source 

Su = j h{e,ip)cosedfl, (2) 

and this flux density is measured in units of Wm^^ Hz^^. Since the flux den- 
sity of astronomical sources is usually very small, a special unit, the Jansky 
(hereafter Jy) has been introduced 

IJy = fO^^'^Wm^^Hz"^ = f 0"^^ ergs-^cm-^Hz"^ . (3) 

As long as the surface element dcr covers the ray bundle completely, the 
power remains constant: 

dPi = dP2 . (4) 

From this, we can easily obtain 

Ivi = Iu2 (5) 

so that the brightness is independent of the distance. The total flux S,y density 
shows the expected dependence of I/r^. 

Another useful quantity related to the brightness is the radiation energy 
density Uu in units of erg cm~^. From dimensional analysis, Uu is intensity 
divided by speed. Since radiation propagates with the velocity of light c, we 
have for the spectral energy density per solid angle 
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u,{Q) = -I,. (6) 
c 

If integrated over the whole sphere, Air steradian, (Eq. [6]) will yeild the total 
spectral energy density 

j u^{n)dn=^J /^di7. (7) 

(47r) (47r) 



2.1 Radiative Transfer 

The equation of transfer is 



di, 
ds 



(8) 



The linear absorption coefhcient Ki, and the emissivity are independent 
of the intensity leading to the above form for dl^ . 

In Thermodynamic Equilibrium (TE) there is complete equilibrium of 
the radiation with its surroundings, the brightness distribution is described 
by the Planck function, which depends only on the temperature T, of the 
surroundings. The properties of the Planck function will be described in the 
next section. 

In Local Thermodynamic Equilibrium (LTE) Kirchhoff 's law holds 



(9) 



This is independent of the material, as is the case with complete thermody- 
namic equilibrium. In general however, J,^ will differ from B^{T). 
If we define the optical depth dr^ by 



dTi, — —Kyds 



or 



r^(s) = / K^{s)ds, 



(10) 
(11) 



then the equation of transfer ([8]) can be written as 



K,y ds dr.. 



(12) 



The solution of ^ is 
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/,(s) = /,(0)e-^-(^)+ J S,(r(r))e-Mr 



(13) 



If the medium is isothermal, 

T{t) = r(.s) = T = const. 

the integral is 



I, (s) = (0) e-^" + (T) (1 - c--'^ ) 



(14) 



For a large optical depth, that is for t,^(0) — > oo, in LTE approaches the 
limit 

I, = B,{T). (15) 

This is case for planets and the 2.7 K microwave background. The difference 
between /,/(s) and li,{0) gives 

AUs) = /,(s) - 7,(0) = {B,{T) - 7,(0)) (1 - e"^) . (16) 

Eq. [Tni represents an on-source minus an off-source measurement. 



2.2 Black Body Radiation 

and Brightness Temperature 

The spectral distribution of the radiation of a black body in thermodynamic 
equilibrium is given by the Planck law 



BAT) 



2hv^ 



1 



f,2 ^hy/kT _ ^ 



If hv <C fcT, one obtains Ray leigh- Jeans Law. 



(17) 



This is an expression of classical physics in which there is an energy kT per 
mode. The term is the density of states for 3 dimensions. In the millimeter 
and submillimeter range, one frequently defines a radiation temperature, J{T) 
as 



J{T) 



I = 



hv 



1 



(18) 



2fc;/2 ^ k e'^'^l^'^ - 1 ■ 
Inserting numerical values for k and h, we find that the Rayleigh-Jeans 
relation holds for frequencies 



(19) 
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It can thus be used for all thermal radio sources except perhaps for low 
temperatures in the mm/sub-mm range. 

In the Rayleigh- Jeans relation, the brightness and the thermodynamic 
temperature of the black body that emits this radiation are strictly propor- 
tional p?| . This feature is so useful that it has become the custom in radio 
astronomy to measure the brightness of an extended source by its bright- 
ness temperature Tb. This is the temperature which would result in the given 
brightness if inserted into the Rayleigh-Jeans law 



1 



2k j/2 

Combining ^ with ([20]) . we have 



2k 



-TuAf2 



For a Gaussian source, this relation is 



(20) 



(21) 



Jy 



0.0736 Tb 



e 


2 


A 


arc seconds 




mm 



(22) 



That is, if the flux density Si, and the source size are known, then the true 
brightness temperature, Tb, of the source can be determined. The concept of 
temperature in radio astronomy has given rise to confusion. If one measures 
5*1, and the apparent source size, Eq. [52] allows one to calculate the main 
beam brightness temperature^ Tmb- Details of this procedure will be given 
in Section [51 The performance of coherent receivers is characterized receiver 
noise temperature; see Section [3.2. II for details. The combination of receiver 
and atmosphere is characterized by system noise temperature; this is discussed 
in Section [01 and following. 

If Ii, is emitted by a black body and hi' ^ kT then (I^D]) gives the thermo- 
dynamic temperature of the source, a value that is independent of ly. If other 
processes are responsible for the emission of the radiation, Tb will depend on 
the frequency; it is, however, still a useful quantity and is commonly used in 
practical work. 

This is the case even if the frequency is so high that condition is not 
valid. Then (|5D)) can still be applied, but Tb is different from the thermody- 
namic temperature of a black body. However, it is rather simple to obtain 
the appropriate correction factors. 

It is also convenient to introduce the concept of brightness temperature 
into the radiative transfer equation pB)) . Formally one can obtain 



J(.T) - ^(MT) - /.(0))(1 - e--(^)) . 

Usually calibration procedures allow one to express J(T) as T. This measured 
quantity is referred to as T^, the radiation temperature, or the brightness 
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temperature, Tb. In the cm wavelength range, one can apply (j20p to p2p and 
one obtains 



dr.. 



7b(s)~T(s) 



(23) 



where T(s) is the thermodynamic temperature of the medium at the posi- 
tion s. The general solution is 



(24) 




If the medium is isothermal, this becomes 



Tb(s) =Tb(0)e-^-(^) +r(l - c-"'^W) 



(25) 



2.3 The Nyquist Theorem and the Noise Temperature 

We now relate voltage and temperature; this is essential for the analysis of 
receiver systems limited by noise. The average power per unit bandwidth 
produced by a resistor R is 



P. 



2R 



1 



(26) 



where v{t) is the voltage that is produced by i across R, and (• • •) indicates a 
time average. The first factor ^ arises from the condition for the transfer of 
maximum power from R over a broad range of frequencies. The second factor 
i arises from the time average of . An analysis of the random walk process 
shows that 

{v^)=ARkT. (27) 



Inserting this into ([26]) we obtain 



(28) 



Eq. (pS]) can also be obtained by a formulation of the Planck law for one 
dimension and the Rayleigh- Jeans limit. Then, the available noise power of 
a resistor is proportional to its temperature, the noise temperature Tn, and 
independent of the value of R. Throughout the whole radio range, from the 
longest waves to the far infrared region the noise spectrum is white, that is, 
its power is independent of frequency. Since the impedance of a noise source 
should be matched to that of the receiver, such a noise source can only be 
matched over some finite bandwidth. 
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2.3.1 Hertz Dipole and Larmor Formula 

In contrast to thermal noise, radiation from an antenna has a definite polar- 
ization and direction. One example (in electromagnetism there are no simple 
examples!) is a Hertz dipole. The total power radiated from a Hertz dipole 
carrying an oscillating current / at a wavelength A is 



2c 










V 2A J 



(29) 



For the Hertz dipole, the radiation is linearly polarized with the E field 
along the direction of the dipole. The radiation pattern has a donut shape, 
with a cylindrically symmetric maximum perpendicular to the axis of the 
dipole. Along the direction of the dipole, the radiation field is zero. One 
can use collections of dipoles, driven in phase, to restrict the direction of 
radiation. As a rule of thumb. Hertz dipole radiators have the best radiative 
efficiency when the wavelength of the radiation is roughly the size of the 
dipole. Following Planck, one can produce the the Black Body law from a 
collection of (quantized) Hertz dipole oscillators with random phases. 

There are similarities between Hertz dipole radiation and radiation from 
atoms. Following Larmor, the power radiated by a single electron oscillating 
with a velocity v{t) is 

PW = ^5^. (30) 

Expressing i) = x, we obtain an average power, emitted over one period of 
sinusoidal oscillation, x — sin i'Kvt. 

^P)=^-mn[ — ) ■ (31) 

Using — \lA 0116 arrives at the expression for spontaneous emission from 
a quantum mechanical system. This is presented again in Eq. 11321 It is often 
the case that quantum mechanical expressions and classical correspond to 
within factors of a few. 



3 Signal Processing 

and Stationary Stochastic Processes 

Next, we present the basics of signal processing and noise analysis needed to 
understand the properties of radiometers |38| . Fourier methods are of great 
value in analyzing receiver properties [7]. A review of submillimeter receiver 
systems is to be found in [39] . 

The concept of spectral power density was introduced in Eq.[26]in a practi- 
cal example. Radio receivers are devices that measure spectral power density. 
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Since the signals are dominated by noise, statistical analyses are needed. The 

most important of these statistical quantities is the probability density func- 
tion, p{x), which gives the probability that at any arbitrary moment of time 
the value of the process x{t) falls within an interval {x — ^ dx, x + ^ dx). For 
a stationary random process, p{x) will be independent of the time t. 

The expected value E{x} or mean value of the random variable x is given 
by the integral 



oo 

E{x} = J xp{x) dx 



(32) 



and, by analogy, the expectation value E{f{x)} of a function f{x) is given 
by 



OO 

E{f{x)}= I fix)pix)dx 



(33) 



A digression: the trend in all forms of communications (including radio 

astronomy!) is toward digital processing. In general, a digitized fimction must 
be sampled at regular intervals. Assume that the input signal extends from 
zero Hz to vq Hz (this is referred to the video band). Then the bandwidth, 
Ai>Q, is fQ and maximum frequency is vq. If wc picture the input as a collection 
of sine waves, it is clear that the sampling rate must be at least vq = 2Av 
to characterize the sinusoid with the highest frequency, sin27ri'ot- Thus the 
sample rate must be twice the bandwidth of the video input. This is referred 
to as the Nyquist Sampling Rate. This is a minimum; a higher sampling 
rate can only improve the characterization of the input. A higher sampling 
rate ("oversampling") will allow the input to be better characterized, thus 
giving a better Signal-to-Noise ratio (S/N) ratio. The sampling functions 
must occupy an extremely small time interval compared to the time between 
samples. If only a portion of the input function is retained in the quantization 
and sampling process, information is lost. This results in a lowering of the 
S/N ratio. 

At present, commercially available digitizers can sample a 1.5 GHz band- 
width with 8 bit quantization. This allows digital systems to reach within a 
few percent of the sensitivity of analog systems, but with much greater sta- 
bility. After digitization, one can maintain that all of the following processes 
are "just arithmetic". The quality of the data (i. e. the S/N) will depend on 
the analog receiver elements. The digital parts of a receiver can lower S/N 
ratios, but not raise them! 
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3.1 Square Law Detectors 

In radio receivers, noise is passed through a device that produces an output 
signal y{t) which is proportional to the power in a given input v{t): 

y{t) = av\t). (34) 

This involves an evaluation of the integral 

/-foo 
-oo 

There are standard approachs used to evaluate this expression. The result is 

E{y(i)}^E{v\t))^a<7l (35) 
For the evaluation of £'{y^(t)}, one must calculate 

-1 /•+00 

E{y[tf} = E{vHt)} = ^ / v^e-^^'^'^'dv 

The result of this integration is 

E{y\t)) = ia^at (36) 

and hence 

al = E{y^t)} - E^{y(t)} = 2E^{y{t)} . (37) 

3.2 Limiting Receiver Sensitivity 

A receiver must be sensitive, that is, be able to detect faint signals in the 
presence of noise. There are limits for this sensitivity, since the receiver input 
and the receiver itself are affected by noise. Even when no input source is 
connected to a receiver, there is an output signal, since any receiver generates 
thermal noise. This noise is amplified together with the signal. Since signal 
and noise have the same statistical properties, these cannot be distinguished. 
To analyze the performance of a receiver we will use the model of an ideal 
receiver producing no internal noise, but connected simultaneously to two 
noise sources, one for the external source noise and a second for the receiver 
noise. To be useful, receivers must increase the input power level. The power 
per unit bandwidth, P^, entering a receiver can be characterized by a tem- 
perature, as given by Eg. 1281 Pjy = kT. Furthermore, it is always the case 
that the noise contributions from source, atmosphere, ground and receiver, 
Ti, are additive, 

An often-used figure of merit is the Noise Factor, F . This is defined as 
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that is, any additional noise generated in the receiver contributes to N2. For 
direct detection systems, such as a Bolometers, G = 1. If Ti is set equal to 
To = 290K, we have 

Tr = - 1 X 290 

Given a value of F, one can determine the receiver noise temperature. If 
for = 115 GHz, F = 3db (=10° i ^^''), Tr=290K, a lousy receiver noise 
temperature. 



3.2.1 Receiver Calibration 



Our goal is to characterize receiver noise performance in degrees Kelvin. In the 
calibration process, a noise power scale (spectral power density) is established 
at the receiver input. In radio astronomy the noise power of coherent receivers 
(i. e. those which preserve the phase of the input) is usually measured in 
terms of the noise temperature. To calibrate a receiver, one relates the noise 
temperature increment AT at the receiver input to a given measured receiver 
output increment Az (this applies to coherent receivers which have a wide 
dynamic range and a total power or "DC" response) . In principle, the receiver 
noise temperature, Tr, could be computed from the output signal z provided 
the detector characteristics are known. In practice the receiver is calibrated 
by connecting two or more known sources to the input. Usually matched 
resistive loads at known (thermodynamic) temperatures Tl and Th are used. 
To within a constant, the receiver outputs are 

zl = (Tl+Tr)G, 
zu = (Th+Tr)G, 

from which 

_ Tu-ny 
2/- 1 

where 

y = zu/zi^ . (40) 

This is known as the "y-factor"; the procedure is a "hot-cold" measurement. 
Note that the y factor as presented here is determined in the Rayleigh-Jeans 
limit. The temperatures Th and Tl arc usiially produced by absorbers in 
the mm/sub- mm range. Usually these are chosen to be at the ambient tem- 
perature (Th = 293 K or 20° C) and at the temperature of liquid nitrogen 
(Tl = 78K or — 195°C). In rare cases, one might use liquid helium, which 
has a boiling point Tl = 4.2 K. In this process, the receiver is assumed to be 



(39) 
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a linear power measuring device (i. e. we assume that any non-linearity of the 
receiver is a small quantity) . Usually such a "hot-cold" calibration is done in- 
frequently. As will be discussed in Section r6.2l in the mm/sub-mm wavelength 
range, measurements of the emission from the atmosphere and then from an 
ambient resistive load are combined with models to provide an estimate of 
the atmospheric transmission. For a determination of the receiver noise, an 
additional measurement, usually with a cooled resistive load is needed. 

Bolometers (Section 14. 1|) do not preserve phase, so are incoherent re- 
ceivers. Their performance is strongly dependent on the bias of the detector 
element. The Bolometer performance is characterized by the Noise Equiva- 
lent Power, or NEP. The NEP is given in units of Watts Hz-^/^. NEP is the 
input power level which doubles the output power. Usually bolometers are 
"AC" coupled, that is, the output responds to differences in the input power, 
so hot-cold measurements are not useful for characterizing bolometers. 



3.2.2 Noise Uncertainties due to Random Processes 

It has been found that both source and receiver noise has a gaussian 
distribution [38j. We assume that the signal is a Gaussian random variable 
with mean zero 

p{x) = ^e-^''^'^\ (41) 
which is sampled at a rate equal to twice the bandwidth. 

Refer to Fig. [TJ By assumption E{vi) — 0. The input, vi has a much larger 
bandwidth, B, than the bandwidth of the receiver, that is, Ai/ <C -B.The 
output of the receiver is V2, with a bandwidth Av. The power corresponding 
to the voltage V2 is (wl)- 

P2^vl^<j^ ^kT^ysGAiy , (42) 

where Ap is the receiver bandwidth, G is the gain, and Tgys is the total 
noise from the input Ta and the receiver Tr. The contributions to Ta are 
the external inputs from the source, ground and atmosphere. Given that the 
output of the square law detector is V3 

(v,) = (vl) (43) 



Input 



Receiver 



Detector 



Fig. 1. The principal parts of a receiver. In the text, vi is at point 1, «2 at point 
2, etc. 
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then after square-law detection we have 



("3) = {vl) = = fcr,y,GAz. . 



(44) 



Crucial to a determination of the noise is the mean value and variance of (W3) 
From ((5^ the result is 



sampling rate, the averaged output, W4, is {\/N)Sv^ where N = 2Avt. 



We have explicitly separated Tgys into the sum Ta + 7r. Finally, we use the 
calibration procedure in Sect. l3.2.1l to eliminate the term kGAv. 



AT 


1 


T 

sys 


VAi^t 



(48) 



The calibration process allows us to specify the receiver output in degrees 
Kelvin instead of in Watts per Hz. We therefore characterize the receiver 
quality by the system noise temperature Tgys ~ 7a + Ir. 

For a given system, the improvement in the RMS noise cannot be better 
than as given in Eq. (|48p . Systematic errors will only increase AT, although 
the time behavior may follow the behavior described by (Eq. H5)) [TO] . We re- 
peat for emphasis: Tgys is the noise from the entire system. That is, it includes 
the noise from the receiver, atmosphere, ground, and and the source. There- 
fore AT is larger for an intense source. Except for some planets, however, 
this situation is rare in the mm/sub- mm range. 

3.2.3 Receiver Stability 

Sensitive receivers are designed to achieve a low value for Tsyg. Since the 
signals received are of exceedingly low power, receivers must also provide 
sufficient output power. This requires a large receiver gain, so even very small 
gain instabilities can dominate the thermal receiver noise. Therefore receiver 
stability considerations are also of prime importance. Great advances have 
been made in improving receiver stability. However in the mm/sub-mm range, 
the atmosphere plays an important role. To insure that the noise decreases 




From W4 and (7| 



crl/N, we obtain the result 



0-4 = kAvG{TA + TrJ/VA^T^ 



(47) 
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following Eg. 1481 systematic effects from atmospheric and receiver instabilities 
are minimized. Atmospheric changes are of crucial importance. These can be 
compensated for by rapidly taking the difference between the measurement 
of the source of interest and a reference region or a nearby calibration source. 
Such comparison switched measurements are necessary for all ground based 
observations. R. H. Dicke was the first to apply comparison switching was 
first applied to radio astronomical receivers [lOj . 

The time spent measuring references or performing calibrations will not 
contribute to an improvement in the S/N ratio. In fact, the subtraction of 
two noisy inputs worsens the difference, but are needed to reduce instabilities 
that give rise to systematic errors. The time r is the total time taken for the 
measurement (i. e. on-source and off-source). 

Even for the output of a total power receiver there will be additional 
noise in excess of that given by (pS)) since the signals to be differenced are 
AT + Tsys and Tgyg. This is needed since AT << Tgyg. For example, if one-half 
the total time is spent on the reference, the AT for difference of on-source 
minus off-source in (|48|) is a factor of 2 larger. 

4 Practical Receivers 

This section is concerned with the practical aspects of receivers that are 
currently in use [15] . [39], with some background material from |38j . 

4.1 Bolometer Radiometers 

The operation of bolometers makes use of the effect that the resistance, R, 
of a material varies with the temperature. When radiation is absorbed by 



Fig. 2. A bolometer is represented by the smaller square to the right. The power 
from an astronomical source, Pq, raises the temperature of the bolometer element 
by AT, which is much smaller than the temperature To of the heat sink. Heat 
capacity, C, is analogous to capacitance. The conductance, Q is analogous to elec- 
trical conductance, G, which is 1/R. The noise performance of bolometers depends 
critically on the thermodynamic temperature. To, and on the conductance Q. The 
temperature change causes a change in the voltage drop across the bolometer (the 
details of the electric circuit are not shown) 



G=R 



1-1 
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the bolometer material, the temperature varies; this temperature change is a 
measure of the intensity of the incident radiation. Because this thermal effect 
is rather independent of the frequency of the radiation absorbed, bolome- 
ters are intrinsically broadband devices. Frequency discrimination must be 
provided by external filters. 

The Noise Equivalent Power (NEP) quoted for a bolometer is the input 
power that doubles the output of this device. The expression for NEP is 



Where e is the emissivity of the background, and Tbg is temperature of the 
background. Typical values for ground based bolometers are e = 0.5, Tbg = 
300 K and Av = 100 GHz. For these values NEPp^, = 1.3 x lO'^^ 
Watts Hz~^/^. With the collecting area of the 30m IRAM telescope and a 
100 GHz bandwidth one can easily detect mJy sources. 

4.2 Currently Used Bolometer Systems 

Bolometers mounted on ground based radio telescopes are background noise 
limited, so the only way to substantially increase mapping speed for extended 
sources is to construct large arrays consisting of many pixels. In present 
systems, the pixels are separated by 2 beamwidths, because of the size of 
individual bolometer feeds. The systems which best cancel atmospheric fluc- 
tuations are composed of rings of close-packed detectors surrounding a single 
detector placed in the center of the array. Two large bolometer arrays have 
produced many significant published results. The first is MAMB02 (MAx- 
Planck-Millimeter Bolometer). This is a 117 element array used at the IRAM 
30-m telescope. This system operates at 1.3 mm, and provides an angular 
resolution of 11". The portion of the sky that is measured at one instant is 
the field of view, (FOV). The FOV of MAMB02 is 240". The second system 
is SCUBA (Submillimeter Common User Bolometer Array) This is used 
on the James-Clerk-Maxwell (JCMT) 15-ni sub-mm telescope at Mauna Kea, 
Hawaii. SCUBA consists of a 37 element array operating at 0.87 mm, with an 
angular resolution of 14" and a 91 element array operating at 0.45 mm with 
an angular resolution of 7.5"; both have a FOV of about 2.3'. The LABOCA 
(LArge Bolometer CAmera) array operates on the APEX 12 meter telescope. 
APEX is on the 5.1 km high Chaijnantor plateau, the ALMA site in north- 
ern Chile. The LABOCA camera operates at 0.87 mm waqvelength, with 295 
bolometer elements. These are arranged in 9 concentric hexagons around a 
center element. The angular resolution of each element is 18.6", the FOV 
is 11.4'. Such an arrangement is ideal for the measurement of small sources 
since the outer rings of detectors can be used to subtract the emission from 
the sky, while the central elements are measuring the source. 
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4.2.1 Superconducting Bolometers 

A promising new development in bolometer receivers is Transition Edge Sen- 
sors referred to as TES bolometers. These superconducting devices may allow 
more than an order of magnitude increase in sensitivity, if the bolometer is 
not background limited. For broadband bolometers used on earth-bound tele- 
scopes, the warm background limits the performance. With no background, 
the noise improvement with TES systems is limited by photon noise; in a 
background noise limited situation, TES's should be ~2-3 times more sen- 
sitive than semiconductor bolometers. For ground based telescopes, TES's 
greatest advantage is multiplexing many detectors with a superconducting 
readout device, so one can construct even larger arrays of bolometers. SCUBA 
will be replaced with SCUBA-2 now being constructed at the U. K. Astron- 
omy Technology Center. SCUBA-2 is an array of 2 TES bolometers, each 
consisting of 6400 elements operating at 0.87 mm and 0.45 mm. The FOV of 
SCUBA-2 will be 8'. The SCUBA-2 design is based on photo-deposition tech- 
nology similar to that used for integrated circuits. This type of construction 
allows for a closer packing of the individual bolometer pixels. In SCUBA-2 
these will be separated by 1/2 of a beam, instead of the usual 2 beam spacing. 

4.2.2 Polarization & Spectral Line Measurements 

In addition to measuring the continuum total power, one can mount a 
polarization-sensitive device in front of the bolometer and thereby measure 
the direction and degree of linear polarization. These devices have been used 
with SCUBA on the James-Clerk-Maxwell Telescope in Hawaii and (in the 
past) with a 19 beam bolometer at the Heinrich-Hertz- Telescope in Arizona. 

It is possible to also carry out spectroscopy, if frequency sensitive ele- 
ments, either Michelson or Fabry-Perot interferometers, are placed before the 
bolometer element. Since these spectrometers operate at the sky frequency, 
the fractional resolution {/!s.v/v) is limited. 

4.3 Coherent Receivers 

Coherent receivers are those which preserve the phase of the signal. Usu- 
ally, coherent receivers make use of the superheterodyne (or more commonly 
"heterodyne") principle to shift the signal input frequency without changing 
other properties; in practice, this is carried out by the use of mixers (Sec- 
tion 3331) ■ Heterodyne is commonly used in all branches of communications 
technology; its use allows measurements with unlimited spectral resolution, 
Av/v. 

4.3.1 The Minimum Noise in a Coherent System 

The ultimate limit for coherent receivers or amplifiers is obtained by ap- 
plication of the Heisenberg uncertainty principle. The minimum noise of a 
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coherent amplifier results in a receiver noise temperature of 



. hv 

irxlnimimum) = — 
k 



(50) 



For incoherent detectors, such as bolometers, phase is not preserved, so this 
limit does not exist. In the millimeter wavelength regions, this noise tem- 
perature limit is quite small. At A=2.6 mm (i/=115 GHz), this limit is 5.5 
K. 



4.3.2 Elements of Coherent Receivers 

The noise in the first element dominates the system noise. The exact ex- 
pression is given by the Friis relation which takes into account the effect of 
having cascaded amplifiers : 
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Analog Receiver Block Diagram 




Fig. 3. On the left are the parts of a coherent receiver. For 1^0 <120 GHz one 
can amplify before mixing while for higher frequencies the first circuit element is 
the mixer. In the middle are sketches of the time behavior of broadband noise 
after passing through these parts of the receiver; on the right are sketches of the 
corresponding frequency behavior. The mixer shifts the signal to a lower freuqency 
while the amplifier increases the output over a narrow band. The square law detector 
converts rapidly oscillating signals into a smooth response that has positive values. 
The middle and right sketches are taken from [36) . 
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Where Gi is the gain of the first element, and Tgi is the noise temperature 
of this element. The corresponding values apply to the following elements in 
a receiver. For A <3 mm, cooled first elements typically have Gi = 10^ and 
Tsi = 50K; for A <0.3 mm, cooled first elements typically have Gi = 1 and 
Tsi = 500K. 

4.3.3 Mixers 

Mixers allow the signal frequency to be changed without altering the charac- 
teristics of the signal. In the mixing process, one multiplies the input signal 
with an intense monochromatic signal from a local oscillator, LO, in a non- 
linear circuit element. In principle a mixer produces a shift in frequency of 
an input signal with no other effect on the signal properties. For a single 
mixer, two frequency bands, at equal separations from the LO frequency 
are shifted into intermediate (IF) frequency band. This is Double Sideband 
(DSB) mixer operation. These are referred to as the signal and image bands. 
In the mm/sub-mm wavelength ranges, such mixers are still commonly used 
as the first stage of a receiver. For single dish continuum measurements, both 
sidebands contain the signal, so DSB operation does not decrease the signal- 
to-noise (S/N) ratio. However, for single dish spectral line measurements, the 
spectral line of interest is in one sideband only. The other sideband is then 
a source of extra noise (i. e. lower S/N ratio) and perhaps confusing lines. 
Therefore single sideband (SSB) operation is desired. If the image sideband 
is eliminated, the mixer is said to operate in SSB mode. This can be accom- 
plished by inserting a filter before the mixer. However, filters cause a loss 




Fig. 4. A sketch of the frequencies shifted from the sky frequency (top) to the 
output (lower) of a double sideband mixer. In this example, the input is at the 
sky frequencies for the Upper Side Band (USB) of 115 GHz, and Lower Side Band 
(LSB) of 107 GHz while the output frequency is 4 GHz. The slanted boxes represent 
the passbands; the direction of the slant in the boxes indicate the upper (higher) 
and lower (lower) edge of the bandpass in frequency [53'. ALMA mm mixers are 
SSB mixers so these effects are avoided. 
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of signal, so lower the S/N ratio. For low noise applications a more complex 
arrangement is needed. In many cases, a single sideband mixer is used. This 
consists of two identical mixers driven by a single local oscillator through a 
phase shifting device. 

For example, the ALMA Band 7 mm/sub-mm receiver shifts a signal 
centered at 345 GHz from the sky frequency, 341 to 349 GHz to 4-12 GHz. 
At 4 to 12 GHz the signal is easily amplified. This is an SSB mixer; the 
unwanted sideband is not accepted. 

Noise in mixers has 3 causes. The first is the mixer itself. Since one half 
of the input signal at i^sky is shifted to an unwanted frequency t'LO + t'skyj 
the signal suffers a factor-of-two (3 db) loss. This is referred to as conver- 
sion /oss. Classical mixers typically have 3 db (=factor of 2) loss. In Eq. [5T| 
Gi = 1/2. In addition there will be an additional noise contribution from the 
mixer itself ( Tgi in ([?T|) 1. Second, the LO may have "phase noise", that is 
a rapid change of phase, which will affect signal properties, so adds to the 
uncertainties. Third, the amplitude of the LO may vary. This effect can be 
minimized since the mixer LO power is adjusted so that the mixer output is 
saturated. Then there is no variation of the output signal power if LO power 
varies. 

4.4 HEMTs/MMICs 

Within a highly ordered crystal made of identical atoms, free electrons can 
move only within certain energy bands. By varying the material, both the 
width of the band, the band gap, and the energy to reach a conduction band 
can be varied. The crucial part of any semiconductor device is the junction. 
On the one side there is an excess of material with negative carriers, forming 
n-type material and the other side material with a deficit of electrons, that 
is p-type material. The p-type material has an excess of positive carriers, or 
holes. At the junction of a p- and n-type material, the electrons in the n-type 
material can diffuse into the p-type material (and vice- versa), so there will be 
a net potential difference. The diffusion of charges, p to n and n to p, cannot 
continue indefinitely, but a difference in the charges near the boundary of the 
n and p material will remain, because of the low conductivity of the semi- 
conductor material. From the potential difference at the junction, a flow of 
electrons in the positive direction is easy, but a flow in the negative direction 
will be hindered. Typical p-n junctions have a slow response so are suitable 
only as square-law detectors. Schottky (metal-semiconductor) junctions have 
a lower capacitance, so are better suited to applications such as microwave 
mixers. The combination of three layers, p-n-p, in a so-called "sandwich", is 
a simple extension of the p-n junction. In Field Effect Transistors, FET's, 
the electric fleld controls the carrier flow so this is an amplifier. At frequen- 
cies of 100 GHz, unipolar devices, which have only one type of carrier, are 
used as microwave amplifier front ends. High Electron Mobility Transistors, 
HEMTs, are an evolution of FETs. The design goals of HEMT's are: (1) to 
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obtain lower intrinsic amplifier noise and (2) operation at higher frequency. 
In HEMTs, the charge carriers arc present in a channel of smaU size. This 
confinement gives rise to a two dimensional electron gas, or "2 DEG", where 
there is less scattering and hence lower noise. When cooled, there is a sig- 
nificant improvement in the noise performance, since the main contribution 
is from the oscillations of nuclei in the lattice, which are strongly tempera- 
ture dependent. To extend the operation of HEMT to higher frequencies, one 
must increase electron mobility, /i, and saturation velocity Vs. A reduction 
in the scattering by introducing impurities ("doping") leads to a larger elec- 
tron mobility, /it, and hence faster transit times, in addition to lower amplifier 
noise. 

For use up to i/=115 GHz with good noise performance, one has turned to 
modifications of HEMTs based on advances in material-growth technology. 
The SEQUOIA receiver array of the Five College Radio Astronomy Obser- 
vatory uses Microwave Monolithic Integrated Circuits (MMIC's) in 32 front 
ends for a 16 beam, two polarization system (pioneered by S. Weinreb). The 
MMIC is a complete amplifier on a single semiconductor, instead of using 
lumped components. The MMIC's have excellent performance in the SO- 
US GHz region without requiring tuning adjustments. The simplicity makes 
MMIC's better suited for multi-beam systems. 

For low noise IF amplifiers, 4 to 8 GHz IF systems using Gallium- Arsinide 
HEMTs with 5 K noise temperature and more than 20 db of gain have been 
built. With Indium-Phosphide HEMTs on GaAs-substrates, even lower noise 
temperatures are possible. As a rule of thumb, one expects an increase of 0.7 
K per GHz for GaAs, while the corresponding value for InP HEMTs is 0.25 
K per GHz. For front ends, noise temperatures of the amplifiers in the 18-26 
GHz range are typically 12 K. 

4.4.1 Superconducting Mixers 

Very general, semi-classical considerations indicate that the slope of the 
current-voltage, I-V, curve for classical mixers changes gently. This leads 
to a relatively poor noise figure for classical mixers, since much of the input 
signal is not converted to a lower frequency. 

A significant improvement can be obtained if the junction is operated in 
the superconducting mode. Then the gap between filled and empty states is 
comparable to photon energies in the mm/sub-mm range. In addition, the 
LO power rcriuirements are » 1000 times lower than are needed for conven- 
tional mixers. Finally, the physical layout of such devices is simpler since the 
mixer is a planar device, deposited on a substrate by lithographic techniques. 
SIS mixers consist of a superconducting layer, a thin insulating layer and an- 
other superconducting layer. SIS mixers depend on single carriers; a longer 
but more accurate description of SIS mixers is "single quasiparticle photon 
assisted tunneling detectors". When the SIS junction is properly biased, the 
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filled states reach the level of the unfilled band, and the electrons can quan- 
tum mechanically tunnel through the insulating strip. The I-V curve for a SIS 
device shows sudden jumps in the I-V curve; these are typical of quantum- 
mechanical phenomena. For low noise operation, the SIS mixer must be DC 
biased at an appropriate voltage and current. If, in addition to the mixer 
bias, there is a source of photons of energy hv, then the tunneling can occur. 
If one then biases an SIS device and applies an LO signal at a frequency v, 
the I-V curve becomes very sharp, so the conversion of sky signals to the IF 
frequency is much more effective than with a classical mixer. 

Under certain circumstances, SIS devices can produce gain. If the SIS 
mixer is tuned to produce substantial gain, it is unstable. Thus, this not 
useful in radio astronomical applications. In the mixer mode, that is, as a 
frequency converter, SIS devices can have a small amount of gain. This tends 
to balance inevitable mixer losses, so SIS devices have losses that are lower 
than classical mixers. SIS mixers have performance that is unmatched in the 
mm/sub-mm region. In addition to single s«de6anrf properties, improvements 
to existing designs include tunerless and SIS mixers. Tuner less mixers have 
the advantage of repeatability in tuning. For ALMA, SIS mm mixer designs 
are wideband, tunerless, single sideband devices with extremely low mixer 
noise temperatures. 

An increase in the gap energy is needed to allow the efficient mixing at 
higher frequenccies. This is done with Niobium superconducting materials; 
the geometric junction sizes are 1 |J,m by 1 )im. For frequencies above 900 GHz, 
one uses niobium nitride junctions. Variants of such devices, such as the use 
of junctions in series, can be used to reduce the capacitance. An alternative 
is to reduce the size of the individual junctions to 0.25 )im. 
SIS mixers are the front ends of choice for operation between 150 GHz and 
900 GHz because these are low-noise devices, the IF bandwidths can be >1 
GHz, these arc timable over ^30% of the frequency range and the local 
oscillator power needed is <1 |J,W. 

4.4.2 Hot Electron Bolometers 

Superconducting Hot Electron Bolometer-mixers (HEB) are heterodyne de- 
vices, in spite of the name. These mixers make use of superconducting thin 
films which have sub-micron sizes. In an HEB mixer excess noise is removed 
either by diffusion of hot electrons out the junction, or by an electron-phonon 
exchange. The first HEBs operating on radio telescopes and used to take as- 
tronomical made use of electron-phonon exchange. The HEB junctions were 
of |J,m size, consisting of Niobium Nitride (NbN), cooled to 4.2 K. Junctions 
of Aluminum-Titanium-Nitride, AlTiN, have provided lower receiver noise 
temperatures. The IF center frequency was 1.8 GHz, and a had a full width 
of 1 GHz. Such a system was used to measure the J = 9 — 8 carbon monoxide 
line at 1.037 THz. Later, the group of First Physical Institute at Cologne 
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University used the Atacama Pathfinder Experiment (APEX) telescope to 
measure the [Nil] hne at 1.5 THz (see Table 1). 

4.4.3 Single Pixel Receiver Systems 

In summary, devices that provide the lowest noise front ends arc: 

for v < 115 GHz, High Electron Mobility Transistors (HEMT) and Microwave 

Monolithic Integrated Circuits (MMIC) 

for 72 < J/ < 800 GHz, Superconducting Mixers (SIS) 

for ly > 900 GHz, Hot Electron Bolometers (HEB) 

SIS mixers provide the lowest receiver noise in the mm and sub-mm range. SIS 
mixers are much more sensitive than classical Schottky mixers, and require 
less local oscillator power, but must be cooled to 4 Kelvin. All millimeter 
mixer receivers are tunable over 10 to 20 % of the sky frequency. From the 
band gaps of junction materials, there is a short wavelength limit to the 
operation of SIS devices. For spectral line measurements at wavelengths, at 
A < 0.3mm, superconducting Hot Electron Bolometers (HEB), which have 
no such limit, have been developed. At frequencies above 2 THz there is a 
transition to far-infrared and optical techniques. The highest frequency het- 
erodyne systems in radio astronomy are used in the Herschel-HIFI satellite. 
These are SIS and HEB mixers. 

The SIS or HEB mixers convert the sky frequency to the fixed IF fre- 
quency, where the signal is amplified by the IF amplifiers. Most of the ampli- 
fication is done in the IF. The IF should only contribute a negligible part to 
the system noise temperature. Because some losses are associated with fre- 
quency conversion, the first mixer is a major source for the system noise. Two 
ways exist to decrease this contribution: (1) by use of either an SIS or HEB 
mixer to convert the input to a lower frequency, or (2) at lower frequencies 
by use of a low-noise amplifier before the mixer. 

4.4.4 Multibeam Systems 

At 3 mm, the SEQUOIA array receiver produced at the Five College Radio 

Astronomy Observatory (FCRAO) with 32 MMIC front ends connected to 
16 beams had been used on 14-m telescope of the FCRAO for the last few 
years. Multibeam system that use SIS front ends are rare. A 9 beam Hetero- 
dyne Receiver Array of SIS mixers at 1.3 mm, HERA, has been installed on 
the IRAM 30-m millimeter telescope to measure spectral line emission. To 
simplify data taking and reduction, the HERA beams are kept on a Right 
Ascension-Declination coordinate frame. HARP-B is a 16 beam SIS system 
in operation at the James-Clerk-Maxwell telescope. The sky frequency is 325 
to 375 GHz. The beam size of each element is 14", with a beam separation 
of 30", and a FOV of about 2'. The total number of spectral channels in a 
heterodyne multi-beam system will be large. In addition, complex optics is 
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needed to properly illuminate all of the beams. In the mm range this usually 
means that the receiver noise temperature of each element is somewhat larger 
than that for a single pixel receiver system. For further details of SEQUOIA, 
HERA or HARP, see the appropriate web sites. 

For single dish continuum measurements at A < 2 mm, multi-beam sys- 
tems make use of bolometers. GeGa bolometers are the most common sys- 
tems and the best such systems have a large number of beams. In the future, 
TES bolometers seem to have great promise. Compared to incoherent reciev- 
ers, heterodyne systems are still the most efficient for spectral lines in the 
range A >0.3mm, although Fabry-Perot systems (such as SPIFI; see the web 
site) may be competitive for some projects. For bolometers on the Herschel 
satellite, one uses gratings or Fabry-Perot systems. For SCUBA-2, an analog 
Michelson ( Fourier transform interferometer) is proposed. 

4.5 Back Ends: Spectrometers 

The term "Back End" is used to specify the devices following the IF ampli- 
fiers. Of the many different back ends that have been designed for specialized 
purposes, spectrometers are probably the most widely used. Previously this 
was carried out in especially designed hardware, but recently there have been 
devices based on general purpose digital computers. 

Spectrometers analyze the spectral information contained in the radiation 
field. To accomplish this these must be SSB and the frequency resolution Ai' 
is usually fine; perhaps in the kHz range. In addition, the time stability must 
be high. If a resolution of Ai/ is to be achieved for the spectrometer, all those 
parts of the system that enter critically into the frequency response have to 
be maintained to better than 0.1 Z\i^. An overview of the current state of 
spectrometers is to be found in j6j. 

4.5.1 Multichannel Filter Spectrometers 

The time needed to measure the power spectrum for a given celestial posi- 
tion can be reduced by a factor n if the IF section with the filters defining 
the bandwidth Av, the square-law detectors and the integrators are built 
not merely once, but n times. Then these form n separate channels that 
simultaneously measure different (usually adjacent) parts of the spectrum. 

4.6 Fourier, Autocorrelation and Cross Correlation Spectrometers 

One method is to Fourier Transform (FT) the input, v(t), to obtain u(i/) and 
then square v{iy) to obtain the Power Spectral Density. From the Nyquist the- 
orem, it is necessary to sample at a rate equal to twice the bandwidth. These 
are referred to as "FX" autocorrelators. Recent developments at the Jodrell 
Bank Observatory have led to the building of COBRA (Coherent Baseband 



24 T. L. Wilson 



Receiver for Astronomy). COBRA can analyze a 100 MHz bandwidth. A sim- 
ilar device with a 1 GHz bandwidth has been built at the Max-Planck- Institut 
in Bonn for use in the mm/sub-mm range on APEX. 

For Autocorrelators, or XF systems, the input v{t) is correlated with a 
delayed signal v{t — t) to obtain the autocorrelation function R{t) function. 
R{t) is then Fourier Transformed to obtain the spectrum. For an XF system 
the time delays are performed in a set of serial digital shift registers with 
a sample delayed by a time r. Autocorrelation can also be carried out with 
the help of analog devices using a series of cable delay lines. Such analog 
correlators have been developed at the University of Maryland together with 
NRAO for use on the Green Bank Telescope (GBT); these are used to provide 
very large bandwidths. 

The two significant advantages of digital spectrometers are: (1) flexibility 
and (2) a noise behavior that follows after many hours of integration. 

The flexibility allows one to select many different frequency resolutions and 
bandwidths or even to employ a number of different spectrometers, each with 
different bandwidths. simultaneously. The second advantage follows directly 
from their digital nature. Once the signal is digitized, it is only mathemat- 
ics. Tests on astronomical sources have shown that the noise follows a 1/ \/Bi 
behavior for integration times >100 hours; in these aspects, analog spectrom- 
eters are more limited. 

A serious drawback of digital auto and cross correlation spectrometers had 
been limited bandwidths. Previously 50 to 100 MHz had been the maximum 
possible bandwidth. This was determined by the requirement to meet Nyquist 
sampling rate, so that the analog-to-digital (A/D) converters, samplers, shift 
registers and multipliers would have to run at a rate equal to twice the band- 
width. The speed of the electronic circuits was limited. However, advances 
in digital technology in recent years have allowed the construction of auto- 
correlation spectrometers with several 1000 channels covering bandwidths of 
several GHz. 

Another improvement is the use of recycling auto and cross correlators. 
These spectrometers have the property that the product of bandwidth, B 
times the number of channels, N, is a constant. Basically, this type of system 
functions by having the digital part running at a high clock rate, while the 
data are sampled at a much slower rate. Then after the sample reaches the 
A^th shift register it is reinserted into the first register and another set of 
delays are correlated with the current sample. This leads to a higher number 
of channels and thus higher resolution. Such a system has the advantage of 
high-frequency resolution, but is limited in bandwidth. This has the greatest 
advantage for longer wavelength observations. Both of these developments 
have tended to make the use of digital spectrometers more widespread. This 
trend is likely to continue. 

Autocorrelation systems are used in single telescopes, and make use of the 
symmetric nature of the autocorrelation function ACF. Thus, the number of 



Millimeter/Sub-Millimeter Astronomy 25 

delays gives the number of spectral channels. For cross-correlation, the cur- 
rent and delayed samples refer to different inputs. Cross-correlation systems 
are used in interferometers. In the simplest case of a two-element interferom- 
eter, the output is not symmetric about zero time delay, but can be expressed 
in terms of amplitude and phase at each frequency, where both the phase and 
intensity of the line signal are unknown. Thus, for interferometry the zero de- 
lay of the ACF is placed in channel N/2 and is in general asymmetric. The 
number of delays. A'', allows the determination of N/2 spectral intensities, 
and N/2 phases. The cross-correlation hardware can employ either an XF or 
a FX correlator. The FX correlator has the advantage that the time delay is 
just a phase shift, so can be introduced more simply. 

4.6.1 Acousto-Optical Spectrometers 

Since the discovery of molecular line radiation in the mm wavelength range 
there has been a need for spectrometers with bandwidths of several GHz. 
At 100 GHz, a velocity range of 300 km s^^ corresponds to 100 MHz, while 
the narrowest line widths observed correspond to 30 kHz. Autocorrelation 
spectrometers can reach such large bandwidths only if complex methods are 
used. The AOS makes use of the diffraction of light by ultrasonic waves: these 
cause periodic density variations in the medium through which it passes. 
These density variations in turn cause variations in the bulk constants e and 
n of the medium, so that a plane electromagnetic wave passing through this 
medium will be affected. The most advanced AOS's have been designed and 
built at the First Physical Institute at Cologne University. 



5 Filled Aperture Antennas 

The material presented in the following emphasizes descriptive antenna pa- 
rameters. These allow a fairly accurate but rather simple description of an- 
tenna properties that are needed for an accurate interpretation of astronom- 
ical measurements. For more detail, see [3,. An important concept is reci- 
procity, in which the properties of antennas are the same, irrespective whether 
these are used for receiving or transmitting. Reciprocity is limited under some 
(very special) circumstances that are not encountered in astronomy. 



5.1 Angular Resolution 



From diffraction theory [25j , the angular resolution of a reflector of diameter 
£) at a wavelength A is 



A 
D 



(52) 



This simple result gives an approximate value for but more detailed results 
must take into account details of the illumination. 
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5.2 The Power Pattern P(i?, ip) 

Often, the normalized power pattern, not the power pattern is measured: 



1 



Pr 



(53) 



The reciprocity theorem provides a method to measure this quantity. The 
radiation source can be replaced by a small diameter radio source. The flux 
densities of such sources are determined by measurements using horn an- 
tennas at centimeter and millimeter wavelengths. At short wavelengths, one 
uses planets, or moons of planets, whose surface temperatures are determined 
from infrared data. 

If the power pattern is measured using artificial transmitters, care should 
be taken that the distance from a large antenna A (diameter D) to a small 
antenna B (transmitter) is so large that B produces plane waves across the 
aperture D of antenna A, i. e. is in the far radiation field of A. This is 
the Rayleigh distance; it requires that the curvature of a wavefront emitted 
by B is much less than a wavelength across the geometric dimensions of A. 
This curvature must be /c <C 2D^/X, for an antenna of diameter D and a 
wavelength A. 

Consider the power pattern of the antenna used as a transmitter. If the 
spectral power density, Vi^ in [W Hz^^] is fed into a lossless isotropic antenna, 
this would transmit P power units per solid angle per Hertz. Then the total 
radiated power at frequency v is 47r P^, In a realistic, but still lossless antenna, 
a power P(i?, cp) per unit solid angle is radiated in the direction {{}, ip). If we 
define the directive gain (^(i?, ip) as the 

P{§,^) = G{d,^)P 



or 
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Thus the gain or directivity is also a normalized power pattern similar to (|53p . 
but with the difference that the normalizing factor is / P('i?, ^p) dQ/An. This 
is the gain relative to a lossless isotropic source. Since such an isotropic source 
cannot be realized in practice, a measurable quantity is the gain relative to 
some standard antenna such as a half-wave dipole whose directivity is known 
from theoretical considerations. 



5.3 The Main Beam Solid Angle 



The beam solid angle ^2a of an antenna is given by 
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II 







F„(i?,(p) sini?d^9 dip 



(55) 



this is measured in steradians (sr). The integration is extended over the full 
sphere Att, such that ]7a is the solid angle of an ideal antenna having Pn = 1 
for all of Qa and Pn = everywhere else. Such an antenna does not exist; 
for most antennas the (normalized) power pattern has considerably larger 
values for a certain range of both "i? and ip than for the remainder of the 
sphere. This range is called the main beam or main lobe of the antenna; 
the remainder are the side lobes or back lobes (Fig.[5|). For actual situations, 
the properties are well defined up to the the shortest operating wavelengths. 
At the shortest wavelength, there is still a main beam, but much of the 
power enters through sidelobes. In addition, the main beam efficiency may 
vary significantly with elevation and weather has a large effect. Thus, the 
ability to accurately calibrate the radio telescope at sub-mm wavelengths is 
challenging. 

In analogy to (|55p we define the main beam solid angle Omb by 



The quality of an antenna as a direction measuring device depends on how 
well the power pattern is concentrated in the main beam. If a large fraction 
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Fig. 5. A polar power pattern showing the main beam, and near and far side 
lobes. The weaker far side lobes have been combined to form the stray pattern 
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of the received power comes from the side lobes it would be rather difficult 
to determine the location of the radiation source, the so-called "pointing". 

It is appropriate to define a main beam ejjiciency or (in the slang of 
antenna specialists) beam efficiency, tjb, by 



i7. 



MB 



Ha 
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The main beam efficiency, tjb is the fraction of the power is concentrated 
in the main beam. The main beam efficiency can be modified (within certain 
limits) for parabolic antennas by the illumination of the main reflector. If 
the FWHP beamwidth is well defined, the location of an isolated source is 
determined to an accuracy given by the FWHP divided by the S/N ratio. 
Thus, it is possible to determine positions to small fractions of the FWHP 
beamwidth if noise is the only limit. 

Substituting ([55]) into ([M]) it is easy to see that the maximum directive 
gain Gmax or directivity V can be expressed as 



— Gmax — 



An 



(58) 



The angular extent of the main beam is usually described by the half power 
beam width (HPBW), which is the angle between points of the main beam 
where the normalized power pattern falls to 1/2 of the maximum. For cl- 
liptically shaped main beams, values for widths in orthogonal directions are 
needed. The beamwidth is related to the geometric size of the antenna and 
the wavelength used; the exact beamsize depends on details of illumination. 



5.4 Effective Area 

Let a plane wave with the power density | (S) \ be intercepted by an antenna. 
A certain amount of power is then extracted by the antenna from this wave; 
let this amount of power be Pc- We will then call the fraction 

Ae = Pe / I {S) I (59) 

the effective aperture of the antenna. Ac has the dimension of m^ . Comparing 
this to the geometric aperture Ag we can define an aperture efficiency 77A by 



Ac = VAAg 



(60) 



Consider a receiving antenna with a normalized power pattern P^l-d^ip) 
that is pointed at a brightness distribution -B,y(??, (p) in the sky. Then at the 
output terminals of the antenna, the total power per unit bandwidth, Vi, is 
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By definition, we are in the Ray leigh- Jeans limit, and can therefore exchange 
the brightness distribution by an equivalent distribution of brightness tem- 
perature. Using the Nyquist theorem ([28|) we can introduce an equivalent 

antenna temperature Ta by 



This definition of antenna temperature relates the output of the antenna to 
the power from a matched resistor. When these two power levels are equal, 
then the antenna temperature is given by the temperature of the resistor. 
Instead of the effective aperture we can introduce the beam solid angle 
i^A- Then (pT|) becomes 



which is the convolution of the brightness temperature with the beam pat- 
tern of the telescope. The brightness temperature Tb(i?, </?) corresponds to 
the thermodynamic temperature of the radiating material only for thermal 
radiation in the Rayleigh-Jeans limit from an optically thick source; in all 
other cases Tb is only an convenient quantity that in general depends on the 
frequency. It is important to note that from (|63p . the measured size of an 
extremely compact (i. e. "point") source is the beam size. 

The quantity Ta in (|63p was obtained for an antenna with no ohmic losses, 
and no absorption in the earth's atmosphere. In the mm/sub-mm range, the 
expression Ta in (|63p is actually T^, that is, a temperature corrected for 
atmospheric losses. We will use the term in discussions of mm/sub-mm 
calibration. Since Ta is the quantity measured while Tb is the one desired, 
(j63p must be inverted. (|63p is an integral equation of the first kind, which in 
theory can be solved if the full range of Ta(i?, ^p) and Pn{^, f) are known. In 
practice this inversion is possible only approximately. Usually both Ta(i?, (/j) 
and Pn{'&, f) are known only for a limited range of and ip values, and the 
measured data are not free of errors. Therefore usually only an approximate 
deconvolution is performed. A special case is one for which the source distri- 
bution Tb{^, (fi) has a small extent compared to the telescope beam. Given a 
finite signal-to-noise ratio, the best estimate for the upper limit to the actual 
FWHP source size is one-half of the FWHP of the telescope beam. This point 
cannot be emphasized too much: we cannot assign an arbitrarily small size 
to a source. The best is one- half of the antenna FWHP! 

5.5 Antenna Feed Horns Used Today 



fcTA. 



(62) 



(63) 



Feed horns are needed to guide the power from the reflector (in free space con- 
ditions) into the receiver (in a waveguide); details are contained in [16j.[30j. 
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The electric and magnetic field strengths at the open face of a wave guide 
will vary across the aperture. The power pattern of this radiation depends 
both on the dimension of the wave guide in units of the wavelength, A, and 
on the mode of the wave. The greater the dimension of the wave guide in A, 
the greater is the directivity of this power pattern. However, the larger the 
cross-section of a wave guide in terms of the wavelength, the more difficult it 
becomes to restrict the wave to a single mode. Thus wave guides of a given 
size can be used only for a limited frequency range. The aperture required 
for a selected directivity is then obtained by flaring the sides of a section of 
the wave guide so that the wave guide becomes a horn. 

Great advances in the design of feeds have been made since 1960, and most 
parabolic dish antennas now use hybrid mode feeds. Such "corregated horns" 
are also referred to as Scalar or Multi-Mode feeds. Today such feed horns are 
used on all parabolic antennas. These provide much higher efficiencies than 
simple single mode horns and are well suited for polarization measurements. 

5.6 Multiple Reflector Systems 

If the size of a radio telescope is more than a few hundred wavelengths, de- 
signs similar to those of optical telescopes are preferred. For such telescopes 
Cassegrain, Gregorian and Nasmyth systems have been used. In a Cassegrain 
system, a convex hyperbolic reflector is introduced into the converging beam 
immediately in front of the prime focus. This reflector transfers the converg- 
ing rays to a secondary focus which, in most practical systems is situated 
close to the apex of the main dish. A Gregorian system makes use of a con- 
cave reflector with an elliptical profile. This must be positioned behind the 
prime focus in the diverging beam. In the Nasmyth system this secondary 
focus is situated in the elevation axis of the telescope by introducing another, 
usually flat, mirror. The advantage of a Nasmyth system is that the receiver 
front ends remain horizontal while when the telescope is pointed toward dif- 
ferent elevations. This is an advantage for receivers cooled with liquid helium, 
which may become unstable when tipped. Cassegrain and Nasmyth foci are 
commonly used in the mm/sub- mm wavelength ranges. 

In a secondary reflector system, feed illumination beyond the edge receives 
radiation from the sky, which has a temperature of only a few K. For low-noise 
systems, this results in only a small overall system noise temperature. This 
is signiflcantly less than for prime focus systems. This is quantifled in the so- 
called "G/T value", that is, the ratio of antenna gain of to system noise. Any 
telescope design must aim to minimize the excess noise at the receiver input 
while maximizing gain. For a speciflc antenna, this maximization involves the 
design of feeds and the choice of foci. 

That the secondary reflector blocks the central parts in the main dish from 
reflecting the incoming radiation causes some interesting differences between 
the actual beam pattern from that of an unobstructed telescope. Modern 
designs seek to minimize blockage due to the support legs and subreflector. 



Millimeter/Sub-Millimeter Astronomy 31 



/I i\ 

* I I y 

II \. 
1 1 > 



Vi 

A I 

4 \\ 



I. I 



Fig. 6. The geometry of (a) Cassegrain, (b) Gregorian, (c) Nasmytli and (d) offset 
Cassegrain systems (from [38)1. 



Realistic filled aperture antennas ("single dishes") will have a beam pat- 
tern different from a uniformly illuminated unblocked aperture. First the 
illumination of the reflector will not be uniform but has a taper by 10 dB or 
more at the edge of the reflector. The side-lobe level is strongly influenced 
by this taper: a larger taper lowers the sidelobe level. Second, the secondary 
reflector must be supported by three or four support legs, which will produce 
aperture blocking and thus affect the shape of the beam pattern. In particular 
feed leg blockage will cause deviations from circular symmetry. For altitude- 
azimuth telescopes these side lobes will change position on the sky with hour 
angle. This may be a serious defect, since these effects will be significant for 
maps of low intensity regions if the main lobe is near an intense source. The 
side lobe response can also dependent on the polarization of the incoming 
radiation. 

A disadvantage of on-axis systems, regardless of focus, is that they are 
often more susceptible to instrumental frequency baselines, so-called baseline 
ripples across the receiver band than primary focus systems. Part of this 
ripple is caused by multiple reflections of noise from source or receiver in 
the antenna structure. Ripples can arise in the reciever, but these can be 
removed or compensated rather easily. Telescope baseline ripples are more 
difficult to eliminate: it is known that large amounts of blockage and larger 
feed sizes lead to large baseline ripples. The effect is discussed in somewhat 
more detail in Sect. 16. 41 The influence of baseline ripples on measurements 
can be reduced to a limited extent by appropriate observing procedures. A 
possible solution is the construction of off-axis systems such as the GBT. 

5.7 Antenna Tolerance Theory 

It is convenient to distinguish several different kinds of phase errors in the 
current distribution across the aperture of a two-dimensional antenna. 
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If the correlation distance d is of the same order of magnitude as the 
diameter of the reflector, part of the phase error can be treated as a systematic 
phase variation, either a hnear error resulting only in a tilt of the main beam, 
or in a quadratic phase error which could be largely eliminated by refocussing. 
For d <C the phase errors are almost independently distributed across the 
aperture, while for intermediate cases according to a good estimate for the 
expected value of the RMS phase error is given by: 



1-exp^-- 



(64) 



where A is the distance between two points in the aperture that are to be 
compared and d is the correlation distance. The gain of the system now de- 
pends both on and on d. In addition, there is a complicated dependence 
both on the grading of the illumination and on the manner in which 6 is 
distributed across the aperture. The Ruzc theory can be expressed in the 
following terms: the gain of a reflector with surface phase errors can be ap- 
proximated by an expression 



G{u) = rje 
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where 



77 is the aperture efflciency, 

u — sini?, 

Ai{u) — ^ Ji(u) is the Lambda function, 

D the diameter of the reflector, and 

d the correlation distance of the phase errors. 

From are now two contributions to the beam shape of the system. The flrst is 
that of a circular aperture with a diameter D, but whose response is reduced 
due to the random phase error S. The second term is the so-called error beam. 



DIFFRACTION 




Fig. 7. A sketch showing the relative widths of the main beam and the error 
beam (shown ten times actual size) . The width of the main beam is determined by 
diffraction for the entire antenna, while that for the error beam is determined by 
scale size of the surface irregularities (from |51]). 
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This can be described as equal to the beam of a (circular) aperture with a 
diameter 2(i, its amplitude multiplied by 

The error beam contribution therefore will decrease to zero as (5 — > 0. 

The gain of a filled aperture antenna with phase irregularities d cannot 
increase indefinitely with increasing frequency but reaches a maximum at 
Am = 47re, and this gain is a factor of 2.7 below that of an error-free antenna 
of identical dimensions. Then, if the frequency can be determined at which 
the gain of a given antenna attains its maximum value, the RMS phase error 
and the surface irregularities e can be measured electrically. Experience with 
many radio telescopes shows reasonably good agreement of such values for 
£ with direct measurements, giving empirical support for the Ruze tolerance 
theory. 

6 Single Dish Observational Methods 
6.1 The Earth's Atmosphere 

For ground-based facilities, astronomical signals entering the receiver has 
been attenuated by the earth's atmosphere. In addition to attenuation, the 
receiver noise is increased by atmospheric emission, the signal is refracted and 
there are changes in the path length. Usually these effects change slowly with 
time, but there can also be rapid changes such as scintillation and anomalous 
refraction. Thus propagation properties must be taken into account, if the as- 
tronomical measurements are to be correctly interpreted. In the mm/sub-mm 
ranges, tropospheric effects are especially important. The various constituents 
of the atmosphere absorb by different amounts. Because the atmosphere can 
be considered to be in LTE, these constituents are also radio emitters. 

The total amount of precipitable water (usually measured in mm) above 
an altitude ho is an integral along the line-of-sight. Frequently, the amount 
of II2O is determined by measurements carried out at 225 GHz combined 
with models of the atmosphere. For mm/sub-mm sites, measurements of the 
183 GHz spectral line of water vapor (see Fig. fTS)) can be used to estimate 
the total amoount of H2O in the atmosphere. For sea level sites, the 22.235 
GHz line of water vapor is used for this purpose. The scale height Hn^o ~ 
2 km, is considerably less than i?air ~ 8 km of dry air. For this reason, sites 
for submillimeter radio telescopes are usually mountain sites with elevations 
above « 3000 m. 

The variation of the intensity of an extraterrestrial radio source due to 
propagation effects in the atmosphere is given by the standard relation for 
radiative transfer through a uniform medium (from Eq. l25p . 
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Tb(s) = Tb(0) e-^-^^) + T (1 - e-^-^^)) 



(66) 



Here s is the (geometric) path length along the line-of-sight with s = at 
the upper edge of the atmosphere and s = sq at the antenna. Both the 
(volume) absorption coefficient k and the gas temperature T will vary with 
s, introducing the mass absorption coefficient k^, by 



where g is the gas density; this variation of k can mainly be traced to that 
of g as long as the gas mixture remains constant along the line-of-sight. This 
is a simplified relations. For a more realistic calculations, one must use a 
multi-layer model. 

Because the variation of g with s is so much larger that that of T{s), a 
useful approximation can be obtained by introducing an effective temperature 
for the atmosphere 

Refraction effects in the atmosphere depend on the real part of the (com- 
plex) index of refraction. Except for the anomalous dispersion near water 
vapor lines and oxygen lines, it is essentially independent of frequency. The 
average effect can be calculated; fine corrections are determined from pointing 
corrections. 

A rapidly time variable effect is anomalous refraction (see [38] V If anoma- 
lous refraction is important, the apparent positions of radio sources appear 
to move away from their actual positions by up to 40" for time periods of 
30 seconds. This effect occurs more frequently in summer afternoons than 
during the night. Anomalous refraction is caused by small variations in the 
H2O content, perhaps by single cells of moist air. In the mm and sub-mm 
range, there are measurements of rapidly time variable noise contributions, 
the so-called sky noise. This is produced by variations in the water vapor 
content in the telescope beam. It does not depend in an obvious way on the 
transmission of the atmosphere. This behavior is expected if the effects arise 
within a few km above the telescope and the cells have limited sizes, sky 
noise, small telescopes (I? < 3 m) than for large telescopes 

6.2 Millimeter and Sub-mm Calibration Procedures 
6.2.1 General 

In radio astronomy, one usually follows a three step practical procedure: 
(1) the measurements must be corrected for atmospheric effects, (2) relative 
calibrations are made using secondary standards and (3) if needed, gain versus 
elevation curves for the antenna must be established. In the mm/sub-mm 
ranges, primary calibrators are, in many cases, planets or moons of planets; 
more common secondary calibrators are non-time- variable compact sources. 
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6.2.2 Calibration of mm and sub-mm Wavelength Heterodyne 
Systems 

In the mm/sub- mm wavelength range, the atmosphere has a larger influence 
and can change rapidly, so one must make accurate corrections to obtain well 
calibrated data. In the mm range, most large telescopes are close to the limits 
caused by their surface accuracy, so that the power received in the error beam 
may be comparable to that received in the main beam. Thus, one must use 
a relevant value of beam efficiency. We give an analysis of the calibration 
procedure which is standard in spectral line mm astronomy following the 
presentations in This calibration reference is referred to as the chopper 
wheel method. The procedure consists of: (1) the measurement of the receiver 
output when an ambient (room temperature) load is placed before the feed 
horn, and (2) the measurement of the receiver output, when the feed horn 
is directed toward cold sky at a certain elevation. For (1), the output of the 
receiver while measuring an ambient load, Tamb, is Vamb: 

Kmb = G (Tamb + T,^) ■ (68) 

For step (2), the load is removed; then the response to empty sky noise, Tgky 
and receiver cabin (or ground), Tgr, is 

y,ky = G [F^s Tsky + (1 - Fcff ) Tg, + T,^] . (69) 

-Fcff is referred to as the forward efficiency. This is basically the fraction of 
power in the forward beam of the feed. If we take the difference of T4mb and 
V^ky, we have 

Veal = Kimb — ^ky — G F^ff Ta,„b C"'^" , (70) 

where Ti, is the atmospheric absorption at the frequency of interest. The 
response to the signal received from the radio source, Ta, through the earth's 
atmosphere, is 

AVsig^GT^e-^- 



or 



rpl _ AVsig p rp 



where is the antenna temperature of the source outside the earth's atmo- 
sphere. We define 

= - TTT^ Jamb- (71) 

The quantity is commonly referred to as the corrected antenna tempera- 
ture, but it is really a forward beam brightness temperature. This is the Tmb 
of a source filling a large part of the sky, certainly more than 30'. 
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For sources (small compared to 30'), one must still correct for the telescope 
beam efficiency, which is commonly referred to as B^g- Then 

-Dcff 

for the IRAM 30 m telescope, F^s = 0.9 down to 1mm wavelength, but BcS 
varies with the wavelength. So at A = 3 mm, — 0.65, at 2 mm B^jf — 0.6 
and at 1.3 mm B^s — 0.45, for sources of diameter < 2'. For an object of size 
30', BcS at all these wavelength is 0.65. As usual Tmb can be considered a 
black body with the temperature Tmbj which just fills the beam. This analysis 
is the one used at IRAM. 
In terms of our notation 

■f^MB -Beff 

An antenna pointing at an elevation z to a position of empty sky will 
deliver an antenna temperature 

Ta(^) = T„ + Tatm m (1 - e-^"^(^)) + Ta„,b(l - m) , (72) 

where 

Trx : system noise temperature, 

Tatm ■ effective temperature of the atmosphere, 

Tamb ■ ambient temperature, 

771 : feed efficiency (typically rji — 0.9), 

To : zenith optical depth, 

X{z) : air mass at zenith distance z. 

These parameters can be determined by a series of calibration measure- 
ments. The efficiency r]\ and the other parameters can be determined by a 
least squares fit of (|72p. that is a sky dip giving Ta as a function of X{z). De- 
pending on the weather conditions these measurements have to be repeated 
at time intervals from 15 minutes to hours or so, to be able to detect varia- 
tions in the atmospheric conditions. At some observatories a small separate 
instrument, a taumeter (a sky horn that measures the sky temperature at 
elevations 90°, 60°, 30° and 20°) is available to determine the opacity r at 
10 minute intervals. 

For larger mm wavelength telescopes one cannot perform tipping mea- 
surement often. If a taumeter is not available one must use a more elaborate 
procedure. By measuring the response to a cold load, one can determine the 
receiver noise, and can obtain a good estimate of the noise from the atmo- 
sphere. Then, assuming a value of Tatm and rji = Fcs, one can then determine 
T ^ tq X{z), and can use this to correct for atmospheric absorption. 

To calibrate spectral lines, one frequently measures sources for which one 
has single sideband spectra. Finally observations often have to be corrected 
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for yet another effect: the telescope efficiency usually depends on elevation. 
Usually the telescope surface is set optimally for some intermediate zenith 
distance z w 40°. Both for z « 0° and 70° the efficiency usually decreases 
somewhat. 



6.2.3 Bolometer Calibrations 

Since most bolometers are A. C. coupled (i. e. responds to differences), the 
D. C. response (i. e. responds to total power) to "hot-cold" or "chopper 
wheel" calibration methods are not used. Instead astronomical data are cali- 
brated in two steps: (1) measurements of atmospheric emission to determine 
the opacities at the azimuth of the target source, and (2) the measurement 
of the response of a nearby source with a known flux density; immediately 
after this, a measurement of the target source is carried out. 



6.2.4 Compact Sources 

Usually the beam of radio telescopes are well characterized by Gaussians. 
As mentioned previously, Gaussians have the great advantage that the con- 
volution of two Gaussians is another Gaussian. For Gaussians, the relation 
between the observed source size, ^o, the beam size ^b, and actual source size, 
6's, is given by: 

el = el ^ el. (73) 

This is a completely general relation, and is widely used to deconvolve 
source from beam sizes. Even when the source shapes are not well repre- 
sented by Gaussians these are usually approximated by sums of Gaussians in 
order to have a convenient representation of the data. The accuracy of any 
determination of source size is limited by (|73p . A source whose size is less 
than 0.5 of the beam is so uncertain that one can only give as an upper limit 
of 0.5 6lb. 

If the (lossless) antenna (outside the earth's atmosphere) is pointed at a 
source of known flux density with an angular diameter that is small com- 
pared to the telescope beam, a power Wy at the receiver input terminals 

is available. Here is the antenna temperature corrected for effect of the 
earth's atmosphere. Thus 

' ' (74) 



where F is the sensitivity of the telescope measured in K Jy ^. Introducing 
the aperture efficiency rjA according to (|60p we find 



r = ?7A- 



ttD^ 
8k 



(75) 
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Thus r or can be measured with the help of a cahbrating source provided 
that the diameter D and the noise power scale in the receiving system are 
known. In practical work the inverse of relation (I74p is often used. Inserting 
numerical values we find 

5.^ 3520 . (76) 

The brightness temperature is defined as the Rayleigh- Jeans temperature of 
an equivalent black body which will give the same power per unit area per 
unit frequency interval per unit solid angle as the celestial source. Both T^. 
and Tmb arc defined in the classical limit, and not through the Planck re- 
lation. However the brightness temperature scale has been corrected for an- 
tenna cfhciency. The conversion from source flux density to source brightness 
temperature for sources with sizes small compared to the telescope beam is 
given by (Eq. [22|) : For sources small compared to the beam, the antenna and 
main beam brightness temperatures are related by the main beam efficiency, 

m- 

T' 

(77) 

J MB 

The actual source brightness temperature, Tg is related to the main beam 
brightness temperature by: 

Ts = Tmb^^^^. (78) 

Where we have made the assumption that source and beam are Gaussian 
shaped. The actual brightness temperature is a property of the source. To 
obtain this, one must determine the actual source size. This is a science driver 
for high angular resolution (i. e. interferometry) measurements. Although the 
source may not be Gaussian shaped, one normally fits multiple Gaussians to 
obtain the effective source size. 



6.2.5 Extended Sources 

For sources extended with respect to the beam, the process is vastly more 
complex, because the antenna side lobes also receive power from the celestial 
source, and a simple relation using beam efficiency is not possible without 
detailed measurements of the antenna pattern. The error beam may be a very 
significant source of calibratoin errors, particularly if the measurements are 
carried out near the limit of telescope surface accuracy. In principle t^mb could 
be computed by numerical integration of Pn{^, <p) [cf. (Eq.inU) and (Eg. [5^ ]. 
provided that Pni'djifi) could be measured for large range of d and (p. Un- 
fortunately this is not possible since nearly all astronomical sources are too 
weak; measurements of bright astronomical objects with known diameters 
can be useful. 
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If we assume a source has a uniform brightness temperature over a certain 
sohd angle ilg, then the telescope measures an antenna temperature given by 
which, for a constant brightness temperature across the source, simplifies 

to 

P„(i?,^)dl2 



Ti/ source rri 
A = ^ 



47r 



or, introducing (|55H57)) . 



= '^b'^-^^^T^ Tb = t^b/beamTb , (79) 



mam 
lobe 



where /beam is the beam fiUing factor. For gaussians 



^^^^^ - Wfoi) 

If the source diameter is of the same order of magnitude as the main beam 
the correction factor in (|79p can be determined with high precision from 
measurements of the normalized power pattern and thus (|79p gives a direct 
determination of 773 1 the beam efficiency. A convenient source with constant 
surface brightness in the long cm wavelength range is the moon whose diam- 
eter of = 30' is of the same order of magnitude as the beams of most large 
radio telescopes and whose brightness temperature 

Tb moon = 225 K (80) 

is of convenient magnitude. In the mm and submillimeter range the observed 
Moon temperature changes with Moon phase. The planets form convenient 
thermal sources with known diameters that can be used for calibration pur- 
poses p]. [4T]. 



6.3 Continuum Observing Strategies 
6.3.1 Point Sources 

These measurements are the simplest, but in the sub- mm range the earth's 
atmosphere is a large source of radiation. Compensation of transmission vari- 
ations in the atmosphere is possible if double beam systems can be used. At 
higher frequencies, in the mm/sub-mm range, the rapid movement of the 
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telescope beam (by small movements of the sub-reflector or a mirror in the 
path from receiver to antenna) over small angles, so-called "wobbling" is used 
to produce two beams on the sky from a single pixel. This is used at all large 
millimeter facilities. In the simplest system the individual telescope beams 
should be spaced by a distance of at least 3 FWHP beam widths, and the 
receiver should be switched between them. The separate beams can be im- 
plemented in different ways depending on the frequency and the technical 
facilities at the telescope. Observing procedures for a double beam system are 
usually as follows: the source is first centered on beam one, and the difference 
of the two beams is measured, optimally by wobbling the sub reflector. Then 
the source is centered on beam two, and again the difference is measured. 
This on-off method (better called on-on, because the source is always in one 
of the beams) is often arranged in a time symmetric fashion so that time 
variations of the sky noise and other instrumental effects can be eliminated. 

Multi-beam bolometer systems are now the rule. With these, one can 
measure a fairly large region simultaneously. This allows a higher mapping 
speed, and also provides a method to better cancel sky noise due to weather. 
Such weather effects are referred to as "coherent noise". Some details of more 
recent data methods are given in e. g. |35| . Usually, a wobbler system is needed 
for such arrays, since the bolometer outputs arc usually A. C. -coupled. 

6.3.2 Imaging of Extended Continuum Sources 

If extended areas are to be mapped, some kind of raster scan is employed: 
there must be reference positions at the beginning and the end of the scan. 
Usually the area is measured at least twice in orthogonal directions. After 
gridding, the differences of the images are least squares minimized to produce 
the best result. This procedure is called "basket weaving". 

Extended emission regions can also be mapped using a double beam sys- 
tem, with the receiver input periodically switched between the first and sec- 
ond beam. In this procedure, there is some suppression of very extended 
emission. A simple summation along the scan direction has been used to 
reconstruct infrared images. More sophisticated schemes can recover most, 
but not all of the information [35 . Most telescopes therefore have wobbler 
switching in azimuth to cancel ground radiation. By measuring a source using 
scans in azimuth at different hour angles, and then combining the maps one 
can recover more information ^27] . 

6.4 Additional Requirements 
for Spectral Line Observations 

In addition to the requirements placed on continuum receivers, there are three 
requirements specific to spectral line receiver systems. 
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6.4.1 Radial Velocity Settings 

If the observed frequency of a line is compared to the known rest frequency, 
the relative radial velocity of the line emitting (or absorbing) source and the 
receiving system can be determined. But this velocity contains the motion of 
the source as well as that of the receiving system. Both are measured relative 
to some standard of rest. However, usually only the motion of the source is of 
interest. Thus the velocity of the receiving system must be determined. This 
velocity can be separated into several independent components: 1) Earth 
Rotation with a maximum velocity v = 0.46 kms~^ and 2) The Motion 
of the Center of the Earth relative to the barycenter of the Solar System 
is said to be reduced to the heliocentric system. Correction algorithms are 
available for observations of the earth relative to center of mass of the solar 
system. The resulting radial velocities are then as close to an inertial system. 
Results obtained by many independent investigations show that the solar 
system moves with a velocity given by the standard solar motion. This is 
the solar motion relative to the mode of the velocity of the stars in the 
solar neighborhood. Data from which the standard solar motion has been 
eliminated are said to refer to the local standard of rest (LSR). 

6.4.2 Stability of the Frequency Bandpass 

In addition to the stability of the total power of the receiver, one must also 
have a stable shape of the receiver bandpass. At millimeter and sub-mm 
wavelengths, it is possible that changes in the weather conditions between 
on-source and reference measurements may lead to serious baseline insta- 
bilites. If so, the time betwenn on-soruce and reference measurements must 
be shortened until stable conditions are reached. Such stability is easier to 
obtain if the bandwidth of the spectrometer is narrow compared to the band- 
width of those parts of the receiver in front of the spectrometer. 

6.4.3 Instrumental Frequency Effects 

The result of any observing procedure should result in a spectrum in which 

Ta{i^) ^ for V outside the frequency range of the line. However, quite often 
this is not so because the signal response was not completely compensated 
for by a reference measurement, even if receiver stability is ideal. For larger 
bandwidths, there is an instrumental spectrum and a "baseline" must be 
subtracted from the difference spectrum. Often a linear function of frequency 
is sufficient, but sometimes some curvature is found, so that polynomials of 
second or higher order must be subtracted. 

Often a sinusoidal or quasi-periodic baseline ripple is present because 
a small fraction of the signal is reflected off obstructions in the antenna. 
This reflected signal can form a standing wave pattern. A phase change of 
2TT radians will occur if either the distance, d, over which the signals are 
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interfering is changed by A/2 (where A is the wavelength) or if the frequency 
is changed by 

Au = l^. (81) 

There are several possible sources of reflected radiation: (1) the receiver 
that injects some noise power into the antenna, part of which is then reflected 
back; or (2) strong continuum radiation from sources or the atmosphere. 
In both cases the partial reflection of the radiation in the horn aperture is 
the main cause of the instrumental "baseline ripples". Both changes in the 
position of the telescope and small changes in the receiving equipment can 
cause large changes in the amplitude of the observed ripple. Sometimes the 
amplitude of baseline ripple can be reduced considerably by installing a cone 
at the apex of the telescope that scatters the radiation forming the standing 
wave pattern. 



6.4.4 Spectral Line Observing Strategies 

In radio astronomy spectral line radiation is almost always only a small frac- 
tion of the total power received; the signal sits on a large pedestal of wide 
band noise signals contributed by different sources: the system noise, spillover 
from the antenna and in some cases, a true background noise. To avoid the 
stability problems encountered in total power systems the signal of interest 
must be compared with another signal that contains the same total power 
and differs from the first only in that it contains no line radiation. To achieve 
this aim, mm/sub-mm spcictral line observers usually make use of three ob- 
serving modes that differ only in the way the comparison signal is produced. 

1) Switching Against an Absorber Today this method is used only in 
exceptional circumstances such as for some studies of the 2.7 K cosmic mi- 
crowave background. 

2 ) Frequency Switching. For many sources, spectral line radiation is a 

narrow-band feature, that is, the; emission is centered at t'o, present over a 
small frequency interval, Av, with vq/Av k. 10~^. If all other effects vary 
very little over Av, changing the frequency of a receiver by perhaps 10 Av 
produces a comparison signal with the line shifted. If other contributions 
hardly differ, the final spectrum is proportional to the difference of these 
two measurements. Such "frequency switched" measurements can be done 
with almost any rate. These produce a particularly good compensation for 
wide-band atmospheric instabilities. Such observations can be made for mm 
wave radiation even in poor weather conditions but functions best for lines 
having widths of less than a few MHz. If the spectral line is included in 
the analyzing band in both the signal and the reference phases, the effective 
integration time is doubled. 

3) Position Switching and Wobbler Switching. The received signal 
"on source" is compared with another signal obtained at a nearby position 
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in the sky. If the emission is rather extended and the atmospheric effects 
are large (for example in the case of galactic Carbon Monoxide emission), 
one may need to use two reference measurements, one at a higher, and the 
other at a lower elevation. A number of conditions must be fulfilled: (1) the 
receiver is stable so that any gain and bandpass changes occur only over time 
scales which are long compared to the time needed for position change, and 
(2) there is little line radiation at the comparison region. If so, this method 
is efficient and produces excellent line profiles. A variant of this method is 
wobbler switching. This is very useful for compact sources, especially in the 
mm and sub-mm range. 

4) On the Fly Mapping. This very important observing method is an 
extension of method (3). In this procedure, one takes spectral line data at 
a rate of perhaps one spectrum or more per second. As with total power 
observing, usually one first takes a reference spectrum, and then takes data 
along a given direction. Then one changes the position of the telescope in the 
perpendicular direction, and repeats the procedure until the entire region is 
sampled. Because of the short integration times an entire image of perhaps 
15' X 15' taken with a 30" beam could be finished in roughly 20 minutes. 
At each position, the S/N ratio may be low, but the procedure can be re- 
peated. With each data transfer, the telescope position is read out. Even if 
there are absolute pointing errors, over this short time and small angle the 
relative positions where spectra were taken are accurate. The resulting accu- 
racy is improved because the spectra are oversampled and weather conditions 
are more uniform over the region mapped. To produce the final image the 
individual spectra are placed on a grid and then averaged. 

7 Interferometers and Aperture Synthesis 

From diffraction theory ([5^ . the angular resolution of a radio telescope is 
9 — kX/D, where 9 is the angular resolution, A is the wavelength of the 
radiation received, D is the diameter of the instrument and fc is a factor 
of order unity. For a given wavelength, to improve this angular resolution, 
the diameter D must be increased. The largest mm/sub-mm antenna is the 
Large Millimeter Telescope (LMT), with a 50 meter diameter, and a short 
wavelength limit of 0.8 mm, so 9 = 3". As shown by Michelson ([IS]), a 
resolving power 9 w A/D can be obtained by coherently combining the ouput 
of two reflectors of diameter d D separated by a distance D. 

An important extension of interferometry is aperture synthesis, that is, 
the production of high quality images of a source by combining a number of 
independent measurements in which the antenna spacings cover an equivalent 
aperture. In this chapter, we give an introduction to the principles of aperture 
synthesis. Vastly more detailed accounts with lots and lots of math are to be 
found in [48] or p^. 
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7.1 Two Element Interferometers 

The basic principles can be understood from a consideration of Fig.[8l In panel 
(a) is the response of a single uniformly illuminated aperture of diameter D. 
In panels (b) and (c) we show the response of a two element interferome- 
ter consisting of two small antennas (diameter d) separated by a distance 
D and 2D, where d <^ D. The interferometer response is obtained from the 
multiplication of the outputs of the two antennas. The uniformly illuminated 
aperture has a dominant main beam of width 9 — kX/D, accompanied by 
smaller secondary maxima, or sidelobes. There are two differences between 
the case of a single dish response compared to the case of an interferometer. 
First, for an interferometer, the nomenclature is different. Instead of 'main 
beam and sidelobes' one speaks of 'fringes'. There is a central fringe (or 'white 
light' fringe in the analogy with Young's Two Slit experiment) and adjacent 
fringes. Second, we will show that with the multiplication of the outputs of 
two antennas, the fringes are centered about zero so that the total power 
output of each antenna is supressed. Since some of the information (i. e. total 
power) is not used, for a given spacing only source structure comparable to 
(or smaller than) a fringe is recorded fully. For the case of an interferom- 
eter composed of two small dishes (with dish diameter d ^ D) there is no 
prominent main beam and the sidelobe level does not markedly decrease with 
increasing angular offset from the axes of the antennas. Comparing the width 
of the fringes in panels (b) and (c) one finds that by doubling the separation 
D of the antennas, the fringe width is halved. For the interferometer spacing 
(usually referred to as the baseline) D, in panel (b) the resolving power of the 
filled aperture is not greatly different from the single dish in panel (a), but 
the collecting area of this two element interferometer is smaller. For larger 
spacings, the interferometer angular resolution is greater. 

If a uniform source is extended in angle by a positive and a negative fringe 
in Fig. [8] the response of the multiplied output is zero. For source structure 
smaller than a fringe, the response is not diminished. Thus by increasing 
D, finer and finer source structure is measured. Combining the outputs of 
independent data sets for spacings of D and 2D shows that these select 
different structural components of the source. Finer source structure can be 
recorded if in addition, D n antenna spacings are measured. Such a series 
measurements can be made by increasing the separation of two antennas 
whose outputs are coherently combined. 

A general procedure, aperture synthesis, is now the standard method to 
obtain high quality, high angular resolution images. The first practical demon- 
stration of aperture synthesis in radio astronomy was made by Ryle and his 
associates. Aperture synthesis allows us to reproduce the imaging properties 
of a large aperture by sampling the radiation field at individual positions 
within the area contained within the aperture. In analogy with the approach 
used by Michelson in the optical wavelength range, the advance in radio as- 
tronomy was to measure the mutual coherence function and to show that 
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Fig. 8. Power patterns for different antenna configurations. The horizontal axis 
in this figure is angle. Panel (a) shows that of a uniformly illuminated full aper- 
ture with a full width to half power (FWHP) of kX/D, with fe ~ 1 . In panel 
(b) we show the power pattern of a two element multiplying interferometer con- 
sisting of two antennas of diameter d spaced by a distance D where d <^ D. 
In panel (c) we show the power pattern of the interfermeter system described in 
(b) but now with a spacing 2D. 



these results were sufficient to produce images. Using this approach, a re- 
markable improvement of the radio astronomical imaging was possible. 

Electromagnetic waves induce the voltage Ui at the output of antenna Ai 

UiocEe''^*, (82) 

while at A2 we obtain 

U2'xEe''^^'-^\ (83) 

where E is the amplitude of the incoming plane wave, r is the geometric 
delay caused by the relative orientation of the interferometer baseline B and 
the direction of the wave propagation. For simplicity, in and ([55]) we 
have neglected receiver noise and instrumental phase. The outputs will be 
correlated. Today, all interferometers use direct correlation, since the goal is 
to measure the correlation accurately. In a correlation the signals are input 
to a multiplying device followed by an integrator. The output is proportional 
to 

2 ^ 

Rir) o^jr J e''"*e dt. 
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If r is a time much longer than the time of a single full oscillation, i.e., 
T ^ 2Tr /uj then the average over time T will not differ much from the average 
over a single full period; that is 

R{t) (X ^E'^ J e'"^^ dt 



27r/w 


resulting in 

' ' (84) 



R{t) CX i^^gi^r 



The output of the correlator + integrator thus varies periodically with 
r, the delay time. If the orientation of interferometer baseline B and wave 
propagation direction s remain invariable, r remains constant, so does R{t). 
But since s is slowly changing due to the rotation of the earth, r will vary, 
and we will measure interference fringes as a function of time. 

In order to understand the response of interferometers in terms of mea- 
surable quantities, we consider a two-element system. The basic constituents 
are shown in Fig. [5] If the radio brightness distribution is given by I^{s), 
the power received per bandwidth dv from the source element df2 is 
A{s)I^{s) df2 diy, where A{s) is the effective collecting area in the direction 
s; we will assume the same A{s) for each of the antennas. The amplifiers are 
assumed to have constant gain and phase factors (neglected for simplicity). 



B s 



Tn = B Sin 



■Ai™lifler--__ \ / 



J3a . I 



..Tip 



Visibiin^ 

Fig. 9. A schematic diagram of a two-element correlation interferometer. The an- 
tenna output voltages are Vi and V2; the instrumental delay is n and the geometric 
delay is rg (from i38])- 
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The output of the correlator for radiation from the direction s (Fig. [5]) is 

ri2-^ A{s)I^{s) e''^'' dQdv (85) 

where r is the difference between the geometrical and instrumental delays Tg 
and Ti . If i? is the baseline vector for the two antennas 



Tg — Ti = - B • S — Ti 



(86) 



and the total response is obtained by integrating over the source S 



R{B) = j j A(s)/,(s)exp 




(87) 



This function R{B), the Visibility Function is closely related to the mutual 
coherence function of the source, except for the power pattern A{s) of the 
individual antennas. For parabolic antennas it is usually assumed that A(s) = 
outside the main beam area so that (|87p is integrated only over this region. 
A one dimensional version of (I87p . with a baseline B, v = vq and = 0, is 



i 27ri/o { - B-0 



Such a one dimensional relation is a very useful guide to understand inter- 
ferometer responses. 



7.1.1 Calibration 

Two quantities that must be calibrated for continuum measurements are am- 
plitude and phase. In addition, for spectral line measurements the instrument 
passband must also calibrated. 

The amplitude scale is calibrated using methods that are similar to those 
used for single dish measurements. This consists of using the response of 
each antenna to determine the system noise of the receiver being used. In the 
centimeter range, the atmosphere plays a small role while in the millimeter 
and sub-mm wavelength ranges, the atmospheric effects must be accounted 
for. For phase measurements, a suitable point-like source with an accurately 
known position is required to determine the instrumental phase 2T:vTi in 
(|87p . For interferometers, the calibration sources are usually unresolved or 
point-like sources. Most often these are extragalactic time variable sources. 
To calibrate the response in units of flux density or brightness temperatures, 
these measurements must be referenced to a thermal calibrator. 

The calibration of the instrument passband is carried out by an inte- 
gration of an intense source to determine the channel-to-channel gains and 
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offsets. The amplitude, phase and passband calibrations are carried out be- 
fore the source measurements. The passband calibration is usually carried 
out once per observing session. The amplitude and phase calibrations are 
made more often. Their frequency depends on the stability of the electronics 
and weather. At millimeter wavelengths, the calibrations are usually made 
every few minutes, but may have to be made more often in worse weather or 
at shorter wavelengths. If weather demands that frequent measurements of 
calibrators are required, this is referred to as fast switching. 



7.2 Responses of Interferometers 
7.2.1 Finite Bandvifidth 



Eq.[87] can be used to estimate the effect of a finite bandwidth Av . The 
geometric delay Tg = ■ s is by definition independent of frequency, but 
the instrumental delay t; may not be. Adjusting t; the sum t — — Ti can 
be made equal to zero for the center of the band. Introducing the relative 
phase of a wave by 



we obtain 



fractional part 



^=jB-s + ^i, (89) 

where tpi is the instrumental phase corresponding to the instrumental delay. 
This phase difference varies across the band of the interferometer Ai> by 

Aip = ^B-s—. (90) 

The fringes will disappear when Aip ~ 1 radian. As can be seen the response 
is reduced if the frequency range, that is, the bandwidth, is large compared 
to the delay caused by the separation of the antennas. For large bandwidths, 
the loss of visibility can be minimized by adjusting the phase delay the time 
difference (see Fig.[S]) is negligible. In effect, this is only possible if the ex- 
ponential term in l|87p is kept small. In practice, this is done by inserting a 
delay between the antennas so that ^ B ■ s equals Ti . In the first interfero- 
metric systems this was done by switching lengths of cable into the system; 
currently this is accomplished by first digitizing the signal after conversion to 
an intermediate frequency, and then using digital shift registers. In analogy 
with the optical wavelength range, this adjustment of cable length is equiva- 
lent to centering the response on the central, or white light fringe in Young's 
two-slit experiment. 

The reduction of the response caused by finite bandwidth can be esti- 
mated by an integration of ([87|) over frequency. Taking A{s) and /i/(s) to be 
constants, and integrating over a range of frequencies Av = vi — V2. Then 
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the result is an additional factor, sm{Ai'T) / Avt in ([ST)) . This will reduce 
the interferometer response if Aip ~ 1 . For typical bandwidths of 100 MHz, 
the offset from the zero delay must be ^ 10~* s. This adjustment of delays 
is referred to as fringe stopping. This causes the response of ([57)1 to lose a 
component. To recover this input, an extra delay of a quarter wavelength 
relative to the input of the correlator is inserted, so that the sine and cosine 
response in (|87p can be measured. In digital cross-correlators, (see Sect. l4.6|) . 
the sine and cosine components are obtained from the positive and negative 
delays. The component with even symmetry is the cosine component, while 
that with odd symmetry is the sine component. 



7.2.2 Source Size and Minimum Spacing 



Use ISHl in the following. For an idealized source, of shape /(i^o) — In for 
6 < 9o and /(I'o) = for > 9q; we take the primary beamsize of each 
antenna to be much larger, and define the fringe width for a baseline B Of, to 



be 



The result is 



RiB) ^AIo-00 exp 



sin {Tr6o/9b) 



i7:eo/0b) 



(91) 



The first terms are normalization and phase factors. The important term, in 
the second set of brackets, is a sinx/x function. If >> 0b, the interfer- 
ometer response is reduced. This is sometimes refered to as the problem of 
"missing short spacings" . 



7.2.3 Bandwidth and Beam Narrowing 



In l7.2.11 we noted that on the white light fringe the compensation must reach 
a certain accuracy to prevent a reduction in the interferometer response. 
However for a finite primary antenna beamwidth. A, this cannot be the case 
over the entire beam. For two different wavelengths A; and As, there will be 
a phase difference 



Z\0 = 27rd 



sin(6'offsot) sin (6'offsot) 



As A; 

converting the wavelengths to frequencies, and using sin 9 ^ 9, we have 



A(f> = 2tt 6'offsot - Av 
c 



With use of the relation d ■ 



we have 



(92) 
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The effect in Eq.|n2]is most important for continuum measurements made 
with large bandwidths. This effect can be reduced if the cross correlation 
is carried out using a series of contiguous IF sections. For each of these IF 
sections, an extra delay is introduced to center the response at the value 
which is appropriate for that wavelength before correlation. 

7.3 Aperture Synthesis 

Aperture Synthesis is a designation for methods used to derive the intensity 
distribution of a part of the radio sky from the measured function 

R{B). To accomplish this we must invert the integral equation ([57]) . This 
involves Fourier transforms. For even simple images, a large number of com- 
putations are needed. Thus Aperture Synthesis and digital computing are 
intimately connected. In addition, a large number of approximations to be 
applied. We will outline the most important steps of this development with- 
out, however, claiming completeness. 

For mm/sub-mm imaging, the relevant relation is: 



where I'{x,y) is the intensity I{x,y) modified by the primary beam shape 
A{x,y). One can easily correct I'{x,y) by dividing by A{x,y). 
Important definitions are: 

(1) Dynamic Range: The ratio of the maximum to the minimum intensity in 
an image. In images made with an interferometer array, it should be assumed 
tha the corrections for primary beam taper have been applied. If the mini- 
mum intensity is determined by the random noise in an image, the dynamic 
range is defined by the signal to noise ratio of the maximum feature in the 
image. The dynamic range is an indication of the ability to recognize low 
intensity features in the presence of intense features. If the minimum noise 
is determined by artefacts, i. e. noise in excess of the theoretical noise, the 
image can be improved by 'image improvement techniques'. 

(2) Image Fidelity: Thisis defined by the agreement between the measured 
results and the actual (so-called "true") source structure. A quantitiave com- 
parison would be 



where F is the fidelity, R is the resulting image obtained from the measure- 
ment, and S is the actual source structure. Of course one cannot have a priori 
knowledge of the correct source structure. In the case of simulations, S" is a 
source model, R is the result of processing S through R. 



OQ 




(93) 
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Fig. 10. The most ambitious construction project in radio astronomy is the At- 
acama Large Millimeter Array (ALMA). ALMA will be built in north Chile on a 
5 km high site. It will consist of at least fifty-four 12-m and twelve 7-m antennas, 
operating in 10 bands between wavelength 1cm and 0.3mm. The ALMA project is 
ambitious because of the superconducting receivers, the need for highly accurate 
antennas operating in the open, and the high altitude site. In addition, the data 
rates will be orders of magnitude higher than any existing astronomical facility. At 
the longest antenna spacing, and shortest wavelength, the angular resolution will 
be W5 milli arcseconds (courtesy European Southern Observatory). 

7.3.1 Interferometric Observations 

Usually measurements are carried out in 1 of 3 ways. 

- In the first procedure, measurements of the source of interest and a cali- 
brator are made. This is as in the case of single telescope position switch- 
ing. Two significant differences with single dish measurements are that 
the interferometer measurement may have to extend over a wide range of 
hour angles to provide a better coverage of the uv plane, and that instru- 
mental phase must be determined also. One first measures a calibration 
source or reference source, which has a known position and size, to remove 
the effect of instrumental phases in the instrument and atmosphere and 
to calibrate the amplitudes of the sources in question. Sources and cal- 
ibrators are usually observed alternately. The time variations caused by 
instrumental and weather effects must be slower than the time between 
measurements of source and calibrator. If, as is the case for mm/sub-mm 
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wavelength measurements, weather is an important influence, one must 
switch frequently between target source and calibration source. In fast 
switching one might spend 10 seconds on a nearby calibrator, then a few 
minutes on-source. This method will reduce the amount of phase fluctu- 
ations, but also the amount time available for source measurements. For 
more rapid changes in the earth's atmosphere, one can correct the phase 
using measurements of atmospheric water vapor, or changes in the system 
noise temperature of the individual receivers caused by atmospheric ef- 
fects. The corrections for instrumental amplitudes and phases are assumed 
to be constant over the times when the source is observed. The ratio of 
amplitudes of source and calibrator are taken to be the normalized source 
amplitudes. Since the calibrators have known flux densities and positions, 
the flux densities and positions of the sources can be determined. The 
reference source should be as close to the on-source as possible, but must 
have a large enough intensity to guarantee a good signal-to-noise ratio 
after a short time. Frequently nearby calibrators are time variable over 
months, so a more distant calibrator with a known or fixed fiux density 
is measured at the beginning or end of the session. This source is usually 
rather intense, so may also serve as a bandpass calibrator for spectral line 
measurements. The length of time spent on the off-source measurement 
is usually no more than few minutes. 

In the second procedure, the so-called snapshots, one makes a series of 
short observations (at different hour angles) of one source after another, 
and then repeats the measurements. For sensitivity reasons these are usu- 
ally made in the radio continuum or intense maser lines. As in the first 
observing method, one intersperses measurements of a calibration source 
which has a known position and size to remove the effect of instrumental 
phases in the instrument and atmosphere and to calibrate the amplitudes 
of the sources in question. The images are affected by the shape of the 
synthesized beam of the interferometer system. If the size of the source 
to be imaged is comparable to the primary beam of the individual tele- 
scopes, the power pattcirn of the primary beams will have a large effect. 
This effect can be corrected easily. 

In the third procedure, one aims to produce a high-resolution image of a 

source where the goal is cither high dynamic range or high sensitivity. The 
dynamic range is the ratio of the highest to the lowest brightness level of 
reliable detail in the image. This may depend on the signal-to-noise ratio 
for the data, but for centimeter aperture synthesis observations, spurious 
features in the image caused by the incomplete sampling of the {u, v) plane 
are usually more important than the noise. Frequently one measures the 
source in a number of different interferometer configurations to better fill 
the uv plane. These measurements are taken at different times and after 
calibration, the visibilities are entered into a common data set. 
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An extension of this procedure may involve the measurement of adjacent 
regions of the sky. This is mosaicing. In a mosaic, the primary beams 
of the telescopes should overlap, ideally this would be at the half power 
point. In the simplest case, the images are formed separately and then 
combined to produce an image of the larger region. 

In order to eliminate the loss of source flux density due to missing short 
spacings, one must supplement the interferometer data with single dish 
measurements. The diameter of the single dish telescope should be larger 
than the shortest spacing between interferometer dishes. This single dish 
image must extend to the FWHP of the smallest of the interferometer 
antennas. When Fourier transformed and appropriately combined with 
the interferometer response, this data set has no missing flux density. 
Even in a data set containng single dish data, there are "missing spacings". 
Improvements that can be applied to images produced from such data sets 
will be surveyed next. 

7.3.2 Real Time Improvements of Visibility Functions 

Ideally the relation between the measured Vik visibility and the actual visi- 
bility Vik can be considered to be linear 



Average values for the antenna gain factors gk and the noise term Sik (t) are 
determined by measuring calibration sources as frequently as possible. Ac- 
tual values for gk are then computed by linear interpolation. These methods 
make full use of the fact that the (complex) gain of the array is obtained 
by multiplication the gains of the individual antennas. If the array consists 
of n such antennas, n{n — l)/2 visibilities can be measured simultaneously, 
but only (n — 1) independent gains gk are needed (for one antenna, one can 
arbitrarily set ijfc = 1 as a reference). In an array with many antennas, the 
number of antenna pairs greatly exceeds the number of antennas. For phase, 
one must determine n phases. 

Often these conditions can be introduced into the solution in the form of 
closure errors. Introducing the phases if, 6 and ip by 



From ((94|) the visibility phase ipik on the baseline ik will be related to the 
observed phase (pik by 



V,k{t)^g,{t)gl{t)V,k+e,k{t) . 



(94) 



Vik = \Vik\ exp{i^ik} , 

Gik = \9i\\gk\ exp{i6'i}exp{-i6'fc} , 

Vik = l^ifcl exp{i?/>ifc} . 



(95) 



(fiik = ipik + Oi - 9k + Sik , 



(96) 



where Sik is the phase noise. Then the closure phase ^iki around a closed 
triangle of baseline ifc, kl, li, 
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I'iki = 'Pik + Vki + fii = -ipik + -ipki + i'u + £ik + £ki + £ii , (97) 

will be independent of the phase shifts 6 introduced by the individual an- 
tennas and the time variations. With this procedure, on can minimize phase 
errors. 

Closure amplitudes can also be formed. If four or more antennas are used 
simultaneously, then ratios, the so-called closure amplitudes, can be formed. 
These are independent of the antenna gain factors: 

" |A™||r,„| ■ ^''^ 

Both phase and closure amplitudes can be used to improve the quality of 
the complex visibility function. 

If each antenna introduces an unknown complex gain factor g with am- 
plitude and phase, the total number of unknown factors in the array can 
be reduced significantly by measuring closure phases and amplitudes. If four 
antennas are available, 50 % of the phase information and 33 % of the ampli- 
tude information can thus be recovered; in a 10 antenna configuration, these 
ratios are 80 % and 78 % respectively. 



7.3.3 Multi- Antenna Array Calibrations 

For two antenna interferometers, phase calibration can only be made pair- 
wise. This is referred to as 'baseline based' solutions for the calibration. For a 
multi-antenna system, there are other and better methods. One can use sets 
of three antennas to determine the best phase solutions and then combine 
these to optimize the solution for each antenna. For amplitudes, one can 
combine sets of four antennas to determine the best amplitude solutions and 
then optimize this solution to determine the best solution. This process leads 
to 'antenna based' solutions are used. Antenna based calibrations are used in 
most cases. These are determined by applying phase and amplitude closure 
for subsets of antennas and then making the best fit for a given antenna. 
It is important to note that because of (|50p the output of an antenna can 
be amplified without seriously degrading the signal-to-noise ratio. For this 
reason, cross-correlations between a large number of antennas is possible in 
the radio and mm/sub- mm ranges. This is not the case in the optical or 
near-IR ranges. 



7.4 Interferometer Sensitivity 

The random noise limit to an interferometer system is calculated following 
the method used for a single telescope [35] ■ The RMS fluctuations in antenna 
temperature are 

ATj,^^^, (99) 
yt Av 



Millimeter/Sub-Millimeter Astronomy 55 



where M is a factor of order unity used to account for extra noise from analog 
to digital conversions, digital clipping etc. If we next apply the definition of 
flux density, S^, in terms of antenna temperature for a two-element system, 
we find: 

Z\5, = 2fc^^P^, (100) 

where r is the atmospheric optical depth and Ac is the effective collecting 
area of a single telescope of diameter D. There is additional in this expres- 
sion since a multiplying interferometer does not process all of the information 
(i. e. the total power) that the antennas receive. In this case, there is an addi- 
tional factor of \/2 compared to the noise in a single dish with an equivalent 
collecting area since there is information not collected by a multiplying inter- 
ferometer. We denote the system noise corrected for atmospheric absorption 
by Tgyg — T g cxp T, in order to simplify the following equations. For an array 
of n identical telescopes, there are N — n{n — l)/2 simultaneous pair-wise 
correlations. Then the RMS variation in flux density is 

2 M k T' 

AS^ = . (101) 

Ac^j2NtAv 

This relation can be recast in the form of brightness temperature fluctuations 
using the Rayleigh- Jeans relation; 

S = 2k^^. (102) 

Then the RMS brightness temperature, due to random noise, in aperture 
synthesis images is 

2 M k A2 T' 

An = ^ . 103 

Acf2bV2NtAv 

For a Gaussian beam, J7mb = 1.133 6'^, so we can relate the RMS temper- 
ature fluctuations to observed properties of a synthesis image. The ALMA 
sensitivity calculator is to be found at 



http://w'ww. eso.org/sci/facilities/ALMA/observing/tools 



At shorter wavelengths, the RMS temperature fluctuations are lower. 
Thus, for the same collecting area and system noise, if weather changes are 
unimportant, a millimeter image should be more sensitive than an image 
made at centimeter wavelengths. If the effective collecting area remains the 
same and for a larger main beam solid angle, temperature fluctuations will 
decrease. For this reason, smoothing an image will result in a lower RMS 
noise in an image. However, if smoothing is too extreme, this process effec- 
tively leads to a decrease in collecting area; then there will be no further 
improvement in sensitivity. 

The temperature sensitivity (in Kelvins) for higher angular resolution 
is worse than for a single telescope with an equal collecting area. From the 
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Rayleigh- Jeans relation, the sensitivity in Jansky ( Jy) is fixed by the antenna 
collecting area and the receiver noise, so only the wavelength and the angular 
resolution can be varied. Thus, the increase in angular resolution is made at 
the expense of temperature sensitivity. All other effects being equal, at shorter 
wavelengths one gains in temperature sensitivity 

Compared to single dishes, interferometers have the great advantage that 
uncertainties such as pointing and beam size depend fundamentally on tim- 
ing. Such timing uncertainties can be made very small compared to all other 
uncertainties. In contrast, the single dish measurements are critically depen- 
dent on mechanical deformations of the telescope. In summary, the single 
dish results are easier to obtain, but source positions and sizes on arc second 
scales are difficult to estimate. The interferometer system has a much greater 
degree of complexity, but allows one to measure such fine details. The single 
dish system responds to the source irrespective of the relation of source to 
beam size; the correlation interferometer will not record source structures 
larger than a few fringes. 

Aperture synthesis is based on sampling the visibility function V{u, v) 
with separate antennas to provide samples in the (w, v) plane. Many configu- 
rations are possible, but the goal is the densest possible coverage of the (u, v) 
plane. If one calculates the RMS noise in a synthesis image obtained by sim- 
ply Fourier transforming the {u, v) data, one usually finds a noise level many 
times higher than that given by (|103p or ()101|) . There are various reasons 
for this. One cause is phase fiuctuations due to atmospheric or instrumen- 
tal infiuences such as LO instabilities. Another cause is due to incomplete 
sampling of the (m, v) plane. This gives rise to instrumental features, such as 
stripe-like features in the final images. Yet another systematic effect is the 
presence of grating rings around more intense sources; these are analogous 
to high side lobes in single dish diffraction patterns. Over the past 20 years, 
it has been found that these effects can be substantially reduced by software 
techniques such as CLEAN and Maximum Entropy. 

7.4.1 Post Real Time Improvements of Visibility Functions 

Before applying specialized techniques, the data must be organized in a useful 
way without lowering the signal-to-noise ratios. To speed up computations for 
inverting ([93|) one uses the Cooley-Tukey fast Fourier transform algorithm. 
In order to use the FFT in its simplest version, the visibility function must 
be placed on a regular grid with sizes that are powers of two of the sampling 
interval. Since the observed data seldom lie on such regular grids, an interpo- 
lation scheme must be used. If the measured points are randomly distributed, 
this interpolation is best carried out using a convolution procedure. 

If some of the spatial frequencies present in the intensity distribution are 
not present in the (w, v) plane data, then changing the amplitude or phase 
of the corresponding visibilities will not have any effect on the reconstructed 
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intensity distribution - these have been eliminated. The extent of this effect 
is shown by the "dirty beam". 

Expressed mathematically, if Z is an intensity distribution containing only 
the unmeasured spatial frequencies, and Pd is the dirty beam, then 

Pd «) Z = 0. 

Hence, if / is a solution of the convolution equation (|105p then so is 
where a. is any number. This shows that there is no unique solution to the 
problem. 

The solution with visibilities = for the unsampled spatial frequencies 
is usually called the principal solution, and it differs from the true intensity 
distribution by some unknown invisible or ghost distribution. It is the aim 
of image reconstruction to obtain reasonable approximations to these ghosts 
by using additional knowledge or plausible extrapolations, but there is no 
direct way to select the "best" or "correct" image from all possible images. 
The familiar linear deconvolution algorithms are not adequate and nonlinear 
techniques must be used. 

The result obtained from the gridded uv data can be Fourier transformed 
to obtain an image with a resolution corresponding to the size of the array. 
However, this may still contain artifacts caused by the details of the observ- 
ing procedure, especially the limited coverage of the (uv) plane. Therefore 
the dynamic range of such so-called dirty maps is rather small. This can be 
improved by further data analysis, as will be described next. 

If the calibrated visibility function V{u, v) is known for the full (u, v) plane 
both in amplitude and in phase, this can be used to determine the (modified) 
intensity distribution I'(x,y) by performing the Fourier transformation (|93p . 
However, in a realistic situation V{u, v) is only sampled at discrete points 
within a radius = Mmax along elliptical tracks, and in some regions of the 
{u,v) plane, V{u,v) is not measured at all. 

We can weight the visibilities by a grading function, g. Then for a discrete 
number of visibilities, we have a version of (|93p involving a summation, not 
an integral, to obtain an image via a discrete Fourier transform (DFT): 

/D(a;,y) =^5K,^'fc)^K,ffe)e-''"("'="+'"=^\ (104) 

k 

where g{u, v) is a weighting function called the grading or apodisation. To a 
large extent g{u, v) is arbitrary and can be used to change the effective beam 
shape and side lobe level. There are two widely used weighting functions: 
uniform and natural. Uniform weighting uses g{uk,Vk) = 1, while Natural 
weighting uses g{uk, gk) = l/-^s(fc), where Ns{k) is the number of data points 
within a symmetric region of the (u,w) plane. In a simple case Ns{k) would 
be a square centered on point k. Data which are naturally weighted result in 
lower angular resolution but give a better signal-to-noise ratio than uniform 
weighting. But these are only extreme cases. One can choose intermediate 
weighting schemes. These are often referred to as robust weighting (in the 
nomenclature of the AIPS data reduction package). Often the reconstructed 
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image /d may not be a particularly good representation of /', but these are 
related. In another form, (|104p is 

Ixi{x,y) ^ PT^{x,y)®l'{x,y), (105) 

where 

Pd = 5]5K,^^fc)e-''"("'="+'^'=^^ (106) 

k 

is the response to a point source. This is the point spread Junction PSF for 
the dirty beam. Thus the dirty beam can be understood as a transfer func- 
tion that distorts the image. (The dirty beam, Jb, is produced by the Fourier 
transform of a point source in the regions sampled; this is the response of the 
interferometer system to a point source). That is, the dirty map luix, y) con- 
tains only those spatial frequencies (wfc, Vk) where the visibility function has 
been measured. The sum in (|106p extends over the same positions (itfe, Vk) as 
in (|104p . and the side lobe structure of the beam depends on the distribution 
of these points. 

7.5 Advanced Image Improvement Methods 

Digital computing is clearly a crucial part of synthesis array data processing. 
A large part of the advances in radio synthesis imaging during the last 15- 
20 years relies on the progress made in the field of image restoration. In the 
following we present a few schemes that are applied to improve radio images. 
However this is by no means an exhaustive collection. 

7.5.1 Self-Calibration 

Amplitude and phase errors scatter power across the image, giving the ap- 
pearance of enhanced noise. Quite often this problem can be alleviated to 
an impressive extent by the method of self- calibration. This process can be 
applied if there is a sufficiently intense source in the field contained within 
the primary beam of the interferometer system. Basically, self-calibration is 
the equivalent of focusing on the source, analogous to using the focus of a 
camera to sharpen up an object in the field of view. One can restrict the 
self-calibration to an improvement of phase alone or to both phase and am- 
plitude. However, self-calibration is carried in the {u, v) plane. If properly 
used, this method leads to a great improvement in interferometer images 
of compact intense sources such as those of masering spectral lines. If this 
method is used on objects with low signal-to-noise ratios, this method may 
give very wrong results by concentrating random noise into one part of the 
interferometer image (see [S]). 

In measurements of weak spectral lines, the self-calibration is carried out 
with a continuum source in the field. The corrections are then applied to the 
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spectral line data. In the case of intense lines, one of the frequency chan- 
nels containing the emission is used. If self-calibration is applied, the source 
position information is usually lost. 

7.5.2 Applying CLEAN to the Dirty Map 

CLEANing is the most commonly used technique to improve single radio 
interferometer images([23|). The dirty map is a representation of the princi- 
pal solution, but with shortcomings. In addition to its inherent low dynamic 
range, the dirty map often contains features such as negative intensity arti- 
facts. These cannot be real. Another unsatisfactory aspect is that the prin- 
cipal solution is quite often rather unstable, in that it can change drastically 
when more visibility data arc added. Instead of a principle solution that as- 
sumes F = for all unmeasured visibilities, values for V should be adopted 
at these positions in the (u, v) plane. These are obtained from some plausible 
model for the intensity distribution. 

The CLEAN method approximates the actual but unknown intensity dis- 
tribution /(x, y) by the superposition of a finite number of point sources with 
positive intensity Ai placed at positions {xi,yi). It is the aim of CLEAN to 
determine the Ai{xi,yi) such that 

I"{x,y)^J2'^^PD{x-x^,y~yi)+Ie{x,y) (107) 

i 

where /" is the dirty map obtained from the inversion of the visibility function 
and Pd is the dirty beam (|106p . Ie{x, y) is the residual brightness distribution 
after decomposition. Approximation (|107p is deemed successful if is of the 
order of the noise in the measured intensities. This decomposition cannot be 
done analytically, rather an iterative technique has to be used. 

The CLEAN algorithm is most commonly applied in the image plane. This 
is an iterative method which functions in the following fashion: First find the 
peak intensity of the dirty image, then subtract a fraction 7 with the shape 
of the dirty beam from the image. Then repeat this n times. This loop gain 
< 7 < 1 helps the iteration converge, and it is usually continued until the 
intensities of the remaining peaks are below some limit. Usually the resulting 
point source model is convolved with a clean beam of Gaussian shape with a 
FWHP similar to that of the dirty beam. Whether this algorithm produces a 
realistic image, and how trustworthy the resulting point source model really 
is, are unanswered questions. 

7.5.3 Maximum Entropy Deconvolution Method (MEM) 

The Maximun Entropy Deconvolution Method (MEM) is commonly used to 
produce a single optimal image from a set of separate but contiguous images 
[19] . The problem of how to select the "best" image from many possible 
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images which all agree with the measured visibilities is solved by MEM. 
Using MEM, those values of the interpolated visibilities are selected, so that 
the resulting image is consistent with all previous relevant data. In addition, 
the MEM image has maximum smoothness. This is obtained by maximizing 
the entropy of the image. One possible definition of entropy is given by 



where li is the deconvolved intensity and Mi is a reference image incorpo- 
rating all "a priori" knowledge. In the simplest case Mi is the empty field 
Mi = const > 0, or perhaps a lower angular resolution image. 

Additional constraints might require that all measured visibilities should 
be reproduced exactly, but in the presence of noise such constraints are often 
incompatible with li > everywhere. Therefore the MEM image is usually 
constrained to fit the data such that 



has the expected value, where Vi is the measured visibility, is a visibility 
corresponding to the MEM image and ai is the error of the measurement. 

8 Continuum Emission from mm/sub-mm Sources 

In the early days of mm/sub-mm wavelength astronomy receiver sensitivities 
restricted measurements to a few continuum sources. This has improved dra- 
matically with the use of semiconductor bolometers pioneered by F. J. Low. 
Subsequently spectral hues of molecules such as CO, HCN and CS were found. 
These were rather intense, and this led to a blossoming of the field. Subse- 
quently, spectral lines of neutral carbon and a line of ionized carbon were 
detected. We sketch continuum radiation mechanisms and then spectral line 
mechanisms that are specific to the mm/sub-mm wavelength range. 

Radio sources can thus be classified into two categories: those which ra- 
diate by thermal mechanisms and the others, which radiate by nonthermal 
processes. In practice, non-thermal radiation can be explained by the syn- 
chrotron process. This is caused by relativistic electrons spiraling in a mag- 
netic field (see f42] for a delightful and instructive account). For optically 
thin synchrotron radiation, most sources have a flux density that varies as 




(108) 




(109) 




(110) 



where a has a positive value. Synchrotron radiation often shows linear polar- 
ization, and in rare cases, shows circular polarization. The most prominent 
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example of a synchrotron source with a = 0, i. e. a flat spectrum is Sgr A*. 
This source, at 8.5 kpc from the Sun, is considered to be the closest super- 
massive Black Hole. The variation of flux density with frequency may be 
an indication that the synchrotron emission is optically thick, and/or has a 
non-standard geometry (see, e. g. [32]). Interferometry at 1.3 mm of Sgr A* 
with baselines of up to a few lO'^ km is ongoing. The imaging of Sgr A* with 
astronomical unit linear resolution is of high interest. Measurements of syn- 
chrotron emission give only a very limited set of source parameters. When 
combined with other data, synchrotron measurements can be used to obtain 
source parameters (see [53]). 

The most famous example of black body radiation is the 2.73 K cosmic 
microwave background, CMB. This source of radiation is fit by a Planck 
curve to better than 0.1%. The difficulty in detecting it was not due to 
weak signals, but rather due to the fact that the radiation is present in all 
directions, so that scanning a telescope over the sky and taking differences will 
not lead to a detection. The actual discovery was gotten from measurements 
of the noise temperature from the sky, the receiver and the ground, compared 
to the temperature of a helium cooled load using Dicke switching. Studies 
of the CMB are conducted from satellites and balloons; these are directed 
at determinations of the polarization and deviations from the Black Body 
spectrum. 

As an example of thermal radiation, we consider planets. These are black 
bodies with spectra that are an almost exact representation of the Rayleigh- 
Jeans law for various temperatures. For Hil regions such as Orion A the 
spectrum is not a simple black body, but explanation is fairly straightforward. 
If we consider the solution of the equation of radiation transfer ([14)) for an 
isothermal object without a background source 

= S,(T)(1- e-^-), 

we find that < B^, if < 1; the frequency variation of depends on 




(HI) 

can be derived. The correction a{v,T) is usually ^ 1. The term EM is 

EM = I dl 



with units cm~^pc. This is a complex relation and was derived only after 
plowing through lots and lots of math. cannot be directly obtained from 
EM because of clumping. 

It appears that cold dust, with temperatures 10 K to 30 K, makes up much 
of the mass of dust and by implication traces cold interstellar gas, given a 
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dust-to-gas ratio. Since the average size is thought to be ~0.3^ni, the wave- 
length of the radiation is much longer than the size of the emitter. For this 
reason, the efficiency of sub-mm radiation is rather low. Dust grains show lin- 
ear polarization, which leads to the conclusion that grains are elongated and 
aligned by magnetic fields. The direction but not the strength of the magnetic 
field can be determined from dust polarization. The fractional polarization is 
rather small, so requires high sensitivity and care to keep instrumental effects 
small [13]. 

If we use the exact relation, we have 

TbM^Tof }^ ^^^Vi_e— (112) 

Vexp{To/Tdust} - 1 exp{To/2.7} - 1/ 

where Tq = hv/k. This is completely general; if we neglect the 2.7 K back- 
ground, 
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Fig. 11. The spectrum of the star burst galaxy M82. The flux density scale must 
be multiplied by 10'^. This galaxy is rather close to the Sun, but has a star formation 
rate that is much higher than for the Milky Way. The large featureless maximum 
is caused by dust radiation. On top of this are various molecular and atomic lines. 
The different curves represent M82 for the labelled redshifts. Although the emission 
decreases inversely with the square of the distance, this is compensated by the shift 
of dust radiation to lower frequencies. These effects allow one to detect an oject 
such as M82 to redshift z = 12 with the Atacama Large Millimeter Array (ALMA). 
The ALMA receiver bands are indicated by numbers in the upper left side of this 
figure (P. Cox unpublished). 
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Tdust - To ( ) (1 - e-^--) . (113) 

Vexp{To/Tdust} - 1/ 

If Tdust ^ To, we can simplify this expression, but the most important step 
is in making a quantitative connection between Tdust and N^^. The relation 
between Tdust and the gas column density must be determined empirically. 
Unlike the planets, which have measured sizes, the radiation from dust grains 
depends on the surface area of the grains, which cannot be determined di- 
rectly. If a relation between dust mass and r can be determined, it is simple to 
convert to the total mass, since dust is generally accepted to be between 1/100 
and 1/150 th of the total mass. All astronomical determinations are based on 
[22]. One typical parameterized version is given by [M]: A > 100|a,m: 

Tdust = 7x 10-21^ &A^hA-2 (114) 

where A is the wavelength in )im, Nn is in units cm^^, Z is the metalicity as 
a ratio of that of the sun Zq. The parameter b is an adjustable factor used to 
take into account changes in grain sizes. Currently, it is believed that b = 1.9 
is appropriate for moderate density gas and b = 3.4 for dense gas (but this is 
not certain). At long millimeter wavelengths, a number of observations have 
shown that the optical depth of such radiation is small. Then the observed 
temperature is 

T = Tdust Tdust , (115) 

where the quantities on the right side are the dust temperature and optical 
depth. Then the flux density is 

5= ^ -2fcrdustA-Vdust/\^2. (116) 

If the dust radiation is expressed in Jy, the source in FWHP sizes, 9 in 
arc seconds, and the wavelength, A in )im, one has for the column density of 
hydrogen in all forms, A^Hi in the Ray leigh- Jeans approximation, the following 
relation: 

If the dust radiation is expressed in mJy, the source FWHP size, 9, in arc 
seconds, and the wavelength, A in mm, the column density of hydrogen in all 
forms, A'h, in the Rayleigh-Jeans approximation, is: 

ArH^1.93xlO^^§ \' . (117) 
9-^ Z/ZQbTdust 

In the cm and mm wavelength range, the dust optical depth is small and 
increases as A^'' with (3 values between 1 and 2; then flux density increases 
as A~^ to A"**. 

Observationally, it has been determined that, in most cases, the dust 
optical depth increases with A~^; then flux density increases as A"**. Thus, 
dust emission will become more important at millimeter wavelengths and in 
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the infrared. It might appear that Eq. lllll is more accurate than Eq. 11171 
because of the vast amount of math. However for a determination of N^., 
source clumping introduces uncertainties. 



9 Spectral Line Basics 

In local thermodynamic equilibrium (LTE) the intensities of emitted and 
absorbed radiation arc not independent but arc related by Kirchhoff's law 
This applies to both continuous radiation and line radiation. The Einstein 
coefficients give a convenient means to describe the interaction of radiation 
with matter by the emission and absorption of photons [40] .These are: 



.91 Bi2 — (?2 B21 



(118) 



and 



A21 = 



■ B21 



(119) 



9.1 Radiative Transfer with Einstein Coefficients 



When the radiative transfer was considered in Sect. 12. 11 the material proper- 
ties were expressed as the emission coefHcicnt and the absorption coeffi- 
cient K,^. Both and Ku are macroscopic parameters; for a physical theory 
these must to be related to atomic properties of the matter in the cavity. If 
line radiation is considered, the Einstein coefficients are very useful because 
these can be linked directly to the properties of the transition responsible 
for the spectral line. For radiative transfer and needed, so we must 

investigate the relation between k^, and Aik and Bik ■ This is best done by con- 
sidering the possible change of intensity I^, passing through a slab of material 
with thickness ds as in Sect. 12.11 Now we will use Aik and Bik. 

According to Einstein there are three different processes contributing to 
the intensity . Each system making a transition from E2 to Ei contributes 
the energy hv^ distributed over the full solid angle Att. Then the total amount 
of energy emitted spontaneously is 




Fig. 12. Transitions between the states 1 
and 2 and the Einstein probabilities 
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For the total energy absorbed we similarly obtain 

47r Afl 
AE^{v) = hvQ Ni B12 — h ^e.{v) AV— dv dt 
c 47r 

and for the stimulated emission 

4tt d^2 
dE,{v) = hvo N2 B21 — h fcM dV— dpdt . 

C 477 



(120) 



(121) 



(122) 



The line profiles fa.{i^) and ipc{i^) for absorbed and emitted radiation could be 
different, but in astrophysics it is usually permissible to put (pst{v) — 'fc{i^) — 
ip{v). For the volume element we put dV = dcrds, where da is the unit area 
perpendicular to the beam direction. For a stationary situation, we find 



dEc{u) + dEs{i^) - dEi,{i^) ^ dl^dndadi^dt 



in 



An An 
N2 A21 + N2 B21 —h~Nx B12 — h 
c c 



ip{v) dQ da ds dv dt . 



The resulting equation of transfer with Einstein coefficients is 



^ ^ (7V^ B,^ - N2 B21) h fiiy) + ^N2 A2, v{v) 

as c An 



Comparing this with ([H]) we obtain agreement by putting 



(123) 



giN2 
92N1 



(124) 



and 



= ^2 A21 (p{v) 
An 



(125) 



The factor in brackets in ()124p is the correction for stimulated emission. In 
radio astronomy, where the stimulated emission almost completely cancels 
the effect of the true absorption, this is important. How this comes about is 
best seen if we investigate what becomes of (|123l - |125")) if LTE is assumed. 
From p24)) and p25)) we find that 



^ 2^ fg2Ni 



But for LTE, according to ([9]), this should be equal to the Planck function, 
resulting in 
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92 / 


hvo\ 




= — exp 
91 \ 





(126) 



In LTE, the energy levels are populated according to the same Boltzmann dis- 
tribution for the temperature T that applied to full TE. Then the absorption 
coefficient becomes 



Ki, = 2 ~ ^1 ^21 

Stt gi 




(127) 



where we have replaced the B coefficient by the A coefficient, using 



D12 — — A21 



9.2 Dipole Transition Probabilities 

The simplest sources for electromagnetic radiation are oscillating dipoles. 
Radiating electric dipoles have already been treated classically, but it should 
also be possible to express these results in terms of the Einstein coefficients. 
There are two types of dipoles that can be treated by quite similar means: 
the electric and the magnetic dipole. 



Electric Dipole. Consider an oscillating electric dipole 

d(t) = e x{t) = e xq cos cot . 



(128) 



According to electromagnetic theory, this will radiate. The power emitted 
into a full 4tt steradian is. 



Pit) 



2 e^vjtf 

3 c3 



(129) 



Expressing x = d/e and v = x, we obtain an average power, emitted over one 
period of oscillation of 



647r4 



3 c 



3 mn 



(130) 



This mean emitted power can also be expressed in terms of the Einstein A 
coefficient: 

(P) = hVmnAmn ■ (131) 

Equating ()130p and (|131|) we obtain 



A — 



647r4 



(132) 
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This was also cited in Section 12.3.11 where 

Mmn = (133) 

is the mean electric dipole moment of the oscillator for this transition. 

Expression (|132p is applicable only to classical electric dipole oscillators, 
but is also valid for quantum systems. 



9.3 Simple Solutions of the Rate Equation 

In order to compute absorption or emission coefficients in (|124p and (|125p . 
both the Einstein coefficients and the number densities Ni and Nk must be 
known. In the case of LTE, the ratio of A'2 to Ni given by the Boltzmann 
function: 

^2 92 ( hvo\ 



Ni 51 V kTK, 

If C12 and C21 are the collision probabilities per particle (in cm"^ s~^) for 
the transitions 1 — > 2 and 2 — > 1, respectively, Tk is the kinetic temperature. 
When Tex, Tb and Tk ^ hv/k, and if we use for abbreviation 

To = ^ (135) 
k 

we have 



J. _ rp Jb^21 + J0C21 
7x^21 + T0C21 



(136) 



If radiation dominates the rate equation (C21 ^ ^21), then Tex —^Tb- If on 
the other hand colhsions dominate (C21 ^ ^21), then Tex Tk- Since Cik 
increases with increasing A'^ collisions will dominate the distribution in high- 
density situations and the excitation temperature of the line will be equal 
to the kinetic temperature. In low-density situations Tex ~^ Tb- The density 
when A21 ~ C21 ~ N*{av) is called the critical density. The smaller A21, 
the lower is TV*. 

Radiative transfer with high optical depths can be dealt with using the 
Large Velocity Gradient (LVG) approximation. The LVG approximation is 
the simplest model for such transport. In this, it is assumed that the spher- 
ically symmetric cloud possesses large scale systematic motions so that the 
velocity is a function of distance from the center of the cloud, that is, 
y = yoif/i^o)- Furthermore, the systematic velocity is much larger than the 
thermal line width. Then the photons emitted by a two level system at one 
position in the cloud can only interact with those that are nearby. Then the 
global problem of photon transport is reduced to a local problem. With LVG, 
one has a simple method to estimate the the effects of photon trapping in 
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the If we neglect the 2.7K background and use the relation Tq = hv/k^ we 
have 

To 



1 + Tk/To In 



Tij \ 



(137) 



The term (1 — exp {—'i Tij)) / Tij is caused by 'photon trapping' in the cloud. 
If Tij ^ 1, the case of interest, then Aij is replaced by Aij/rij. 



10 Line Radiation from Atoms 

Most atomic transitions give rise to spectral lines at wavelengths in the in- 
frared or shorter. With the exception of radio recombination lines, atomic 
radio lines are rare. The energy levels are described by the scheme "^^^^Lj. 
In this description, S is the total spin quantum number, and 25 + 1 is the 
multiplicity of the line, that is the number of possible spin states. L is the 
total orbital angular momentum of the system in question, and J is the to- 
tal angular momentum. For the lighter elements, the energy levels are best 
described using LS coupling. This is constructed by vectorially summing the 
orbital momenta to obtain the total i, then combining the spins of the in- 
dividual electrons to obtain S, and then vectorially combining L and S to 
obtain J . If the nucleus has a total spin, 7, this can be vectorially combined 
with J to form F. For an is olated system, all of these quantum numbers 
have a constant magnitude and also a constant projection in one direction. 
Usually the direction is arbitrarily chosen to be along the z axis, and the 
projected quantum numbers are referred to as Mp, Mj, Ml and Ms- 

We give a list of the quantum assignments together with line frequencies, 
Einstein A coefficients and critical densities in Table[T]of millimeter and sub- 
millimeter atomic lines. 

The most studied mm/sub- mm atomic lines are those of neutral carbon 
at 492 and 809 GHz. These lines arise from molecular regions that the some- 
what protected from the interstellar ultraviolet radiation. In less obscured 
regions, ionized carbon, or Cii is present. This ion has a fine structure 
line at 157}J.m and is expected to be a dominant cooling line in denser clouds. 
Although these lines might be considered as part of infrared astronomy, the 
heterodyne techniques have reached the 1.4 THz range (=200 fj,m) from the 
ground and will reach 150 /im with the Herschel satellite. The oxygen lines 
must be measured from high flying aircraft such as SOFIA or satellites. The 
following Table [35j gives a few selected atomic transitions. 



11 Emission Nebulae, Radio Recombination Lines 



The physical state of the interstellar medium varies greatly from one region 
to the next because the gas temperature depends on the local energy input. 
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Table 1. Parameters of some atomic lines 



Element and 


Transition 




Aij 1 s 


- 1 


Critical 


Notes 


ionization 










density 




state 










n* 




CI 


'Pi -' Po 


492.16 


7.93 X 10" 


-8 


5 X 10^ 


b 


CI 


'P2 -' Pi 


809.34 


2.65 X 10" 


-7 


lo-* 


b 


CII 


2 p 2 p 

-'3/2 ~ -'^1/2 


1900.54 


2.4 X 10" 


-6 


5 X 10' 


b 


01 


'Po -' Pi 


2060.07 


1.7 X 10" 


-5 


~ 4 X 10^ 


b 


01 


'Pi -' P2 


4744.77 


8.95 X 10" 


-5 


~ 3 X 10® 


a,b 


OIII 


'Pi -' Po 


3392.66 


2.6 X 10" 


-5 


~ 5 X 10^ 


a 


OIII 


'P2 -' Pi 


5785.82 


9.8 X 10" 


■5 


~ 4 X lO' 


a 


Nil 


'Pi -' Po 


1473.2 


2.1 X 10" 


-6 


~ 5 X 10^ 


a 


Nil 


'P2 -' Pi 


2459.4 


7.5 X 10" 


-6 


~ 3 X 10^ 


a 


Nlll 


2 p 2 p 
-'3/2 — 


5230.43 


4.8 X 10" 


-5 


~ 3 X 10' 


a,b 



^ ions or electrons as collision partners 
H2 as a collision partner 



There exist large, cool cloud complexes in which both dust grains and many 
different molecular species are abundant. Often new stars are born in these 
dense clouds, and since they are sources of thermal energy the stars will heat 
the gas surrounding them. If the stellar surface temperature is sufficiently 
high, most of the energy will be emitted as photons with A < 912 A. This 
radiation has sufficient energy to ionize hydrogen. Thus young, luminous stars 
embedded in gas clouds will be surrounded by emission regions in which the 
gas temperature and consequently the pressure will be much higher than in 
cooler clouds. Occasionally an ion will rccombinc with a free electron. Since 
the ionization rate is rather low, the time interval between two subsequent 
ionizations of the same atom will generally be much longer than the time for 
the electron to cascade to the ground state, and the cascading atom will emit 
recombination lines. 



11.1 Rydberg Atoms 

The behavior of Rydberg atoms in the interstellar medium can show complex 
excitation properties; such systems give an indication of the excitation effects 
one often finds with molecules [18]. When ionized hydrogen recombines at 
some level with the principal quantum number n > 1, the atom will emit 
recombination line emission on cascading down to the ground state. The 
radius of the n**^ Bohr orbit is 



(138) 
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and so for large principle quantum number n, the effective radius of the 
atom becomes exceedingly large. Systems in such states are generally called 
Rydberg atoms. Energy levels in these are quite closely spaced, and since 
pressure effects at large n caused by atomic collision may become important, 
the different lines eventually will merge. 

The frequency of the atomic lines of hydrogen-like atoms are given by the 
Rydberg formula 



Vki 



1 



1 



Z Rm I — — — 



i < k 



where 



R 



Ro 



M 



m 



(139) 



(140) 



if TO is the mass of the electron, M that of the nucleus and Z is the effective 
charge of the nucleus in units of the proton charge. For n > 100 we always 
have Z K 1 and the spectra of all atoms are quite hydrogen-like, the only 
difference being a slightly changed value of the Rydberg constant. 

Lines corresponding to the transitions n -\- \ —t n are most intense and 
are called a lines. Those for transitions n + 2 — > n are [3 lines; n + 3 — > n 
transitions are 7 lines; etc. In the identification of a line both the element and 
the principle quantum number of the lower state are given: so H91a is the 
line corresponding to the transition 92 91 of H while H 154£ corresponds 
to 158 154 of H, Hehum and Carbon (see Fig.fn]). 



ma 
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? -2 
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Fig. 13. Recombination lines in the Hll region W3 at 8.5 GHz. The most intense 
fines are H91q, HeQla, C91q:, the total integration time, t, for this spectrum is 
75 hours; even after this time, the RMS noise foUows a theoretical dependence of 
l/yft [from g] 
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All atoms with a single electron in a highly excited state are hydrogen- 
like. The radiative properties of these Rydberg atoms differ only by their 
different nuclear masses. The Einstein coefficients An^, the statistical weights 
gi and the departure coefficients bi are identical for all Rydberg atoms, if 
the electrons in the inner atomic shells are not involved. The frequencies 
of the recombination lines are slightly shifted by the reduced mass of the 
atom. If this frequency difference is expressed in terms of radial velocities 
this difference is independent of the quantum number for a given element. 

The line width of interstellar radio recombination lines is governed by 
external effects; neither the intrinsic line width nor the fine structure of the 
atomic levels has observable consequences. In normal Hii regions, evidence 
for broadening of the lines by inelastic collisions is found for N > 130, from 
the broad line wings. For < 60 the observed lincwidth is fully explainable 
by Doppler broadening. The purely thermal Doppler broadening for hydrogen 
is: 

AVi = 0.21 Jt^. 

2 

The electrons have a velocity distribution that is described very closely by 
a Maxwellian velocity distribution; long range Coulomb forces eliminate any 
deviations with a relaxation time that is exceedingly short. This distribution 
is characterized by an electron temperature and, due to the electrostatic 
forces, the protons should have a similar distribution with the same tem- 
perature. The spectral lines are observed to have Gaussian shapes. Thermal 
Doppler motions for Tc = 10* K produce a line width of 21.4kms^^, however 
a width of ~ 25kms~^ to ~ 30kms~^ is observed. Therefore it is likely that 
nonthermal motions in the gas contribute to the broadening. These motions 
are usually referred to as microturbulence. If we include this effect, the half 
width of hydrogen is generalized to 



AVi = -\/0.04576Te + . (141) 
11.2 LTE Line Intensities 

From the correspondence principle, the data for high quantum numbers can 
be computed by using classical methods, and therefore we will use for A^i 
the expression (|132p for the electric dipole. For the the dipole moment in the 
transition n + 1 — > n we put 

_ eQ„ _ h"^ 2 
where a„ = oq is the Bohr radius of hydrogen, and correspondingly 



4 

m e 
47r2 
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Substituting this expression into (jl32p we obtain, for the limit of large n 

647r6mei° 1 5.36 x 10^ , 



A. 



n-\-l.n 



3h^ c3 



(142) 



We adopt a Gaussian line shape, ifiiy). Introducing the full line width Ai> 
at half intensity points, we obtain for the value of (p at the line center We 
obtain for the optical depth in the center of a line emitted in a region with 
the emission measure for an a line 



EM 



N,{s)Np{s)ds 



Ti^ =1.92 X lO'^ 



-5/2 



cm 



EM 



pc 



cm ^ pc y \ kHz J 



(143) 



(144) 



Here we have assumed that Np{s) w Nc{s) which should be reasonable due 
to the large abundance of H and He (= 0.1 H). We always find that tl ^ 1, 
and therefore that Tl = Te tl , or 



Tl = 1.92 X 10^ 



-3/2 



EM 



cm~6 pc / \kHz / 



(145) 



For J/ > 1 GHz we find that Tc < 1 for the continuum, so that we obtain on 
dividing (|144[) by (|llip and using the Doppler relation 





f \ 


6.985 X 10^ 


" u 


1.1 


'T ' 

^ c 


-1.15 ^ 


Tc 


^^kms^i / 


a{v, To) 


[gHzJ 




K 


1 + Ar(He+)/iV(H+) 



(146) 

in this expresssion, a[v^Te = 1, and Te is the LTE electron temperature, 
denoted as T* The last factor is due to the fact that both A^h+ and N-^^+ 
contribute to — Nj^+ +N-^q+ . Typical values for the H ii region Orion A are 
at 100 GHz are iV(H+)/iV(H+) = 0.08, Tl/Tc w 1 and AVi = 25.7kms-^ 
which give a T* value of 8200 K. Equation (|146p is valid only if both the 
line and continuum radiation are optically thin. This is the case for nearly all 
sources in the mm/sub-mm range. One possible exception is the extraordinary 
case of MWC 349 ^33j which shows time- variability and strong maser action 
in the mm/sub-mm range. 



11.3 Non LTE Line Intensities 

The departure coefficients, bn, relate the true population of level, iV„, to the 
population under LTE conditions, N*, by 
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Nn = fo„ N* . (147) 

For hydrogen and helium, the 6„ factors are always < 1, since the A coefficient 
for the lower state is larger and the atom is smaller so collisions are less 
effective. For states i and k, with k > i we have &„ ^ 1 for LTE. For any 
pair of energy levels, the upper level is always overpopulated relative to the 
lower level. Since hv <^ kT this overpopulation leads to a negative excitation 
temperature. This gives rise to recombination line masering. Usually the line 
optical depth is very small, but the background continuum could be amplified. 
In H II regions, the dominant effect is a slightly lower line intensity which leads 
to a slight overestimate of the electron temperature of the H ii region. 

12 Overview of Molecular Basics 

We present the basic concepts needed to understand the radiation from 
molecules that are widespread in the Interstellar Medium (ISM) [49j . [28] . 
For linear molecules, examples are carbon monoxide, CO, SiO, N2H+; for 
symmetric top molecules, these are ammonia, NH3, CH3CN and CII3CCH 
and for asymmetric top molecules these are water vapor, H2O, formalde- 
hyde, H2CO and II2D+. Finally we present a short account of molecules with 
non-zero electronic angular momentum in the ground state, using OH as an 
example, and then present an account of methanol, CH3OII which has hin- 
dered motion. In each section, we give relations between molecular energy 
levels, column densities and local densities. Although molecular line emis- 
sion is complex, such measurements allow a determination of parameters in 
heavily obscured regions not accessible in the near-infrared or optical. 

12.1 Basic Concepts 

The structure and excitation of even the simplest molecules is vastly more 
complex than atoms. Given the complicated structure, the Schrodinger equa- 
tion of the system will be correspondingly complex, involving positions and 
moments of all constituents, both the nuclei and the electrons. Because the 
motion of the nuclei is so slow, the electrons make many cycles while the nu- 
clei move to their new positions. This separation of the nuclear and electronic 
motion in molecular quantum mechanics is called the Born-Oppenheimer ap- 
proximation. 

Transitions in a molecule can therefore be put into three different cate- 
gories according to different energies, W: 

a) electronic transitions with typical energies of a few eV - that is lines in 
the visual or UV regions of the spectrum; 

b) vibrational transitions caused by oscillations of the relative positions of 
nuclei with respect to their equilibrium positions. Typical energies are 
0.1—0.01 eV, corresponding to lines in the infrared region of the spectrum; 
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c) rotational transitions caused by the rotation of the nuclei with typical en- 
ergies of = 10~^ eV corresponding to lines in the cm and mm wavelength 
range. 

W^°^ = VK°' + W^'^ + W'°^ . (148) 

W^^^ and W^"^ are the vibrational and rotational energies of the nuclei of the 
molecule and is the energy of the electrons. Under this assumption, the 
Hamiltonian is a sum of + W^^^ + From quantum mechanics, the 

resulting wavefunction will be a product of the electronic, vibrational and 
rotational wavefunctions. 

If wc confine ourselves to the mm/sub-mm wavelength ranges, only tran- 
sitions between different rotational levels and perhaps different vibrational 
levels (e. g., rotational transitions of SiO or HC3N from vibrationally excited 
states) will be involved. This restriction results in a much simpler description 
of the molecular energy levels. Occasionally differences between geometrical 
arrangements of the nuclei result in a doubling of the energy levels. An exam- 
ple of such a case is the inversion doubling found for the Ammonia molecule. 



12.2 Rotational Spectra of Diatomic Molecules 

Because the effective radius of even a simple molecule is about 10^ times the 
radius of the nucleus of an atom, the moment of inertia 0^. of such a molecule 
is at least 10^*^ times that of an atom of the same mass. The kinetic energy 
of rotation is 

F,ot = i^ew' = jV2 0e, (149) 

where J is the angular momentum. J is a quantity that cannot be neglected 

compared with the other internal energy states of the molecule, especially if 
the observations are made in the centimeter/millimeter/sub-mm wavelength 
ranges. (Note that J is not the same as the quantum number used in atomic 
physics.) 

For a rigid molecule consisting of two nuclei A and B, the moment of 
inertia is 

0e = mAr\ + mBrQ — mrl (150) 

where 

re=rA-rB (151) 

and 

m = , (152) 

TTlA + niB 

and 



J = &eU> 



(153) 
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is the angular momentum perpendicular to the line connecting the two nuclei. 
For molecules consisting of three or more nuclei, similar, more complicated 
expressions can be obtained. 0^ will depend on the relative orientation of the 
nuclei and will in general be a (three-axial) ellipsoid. In (|153|) values of Oe 
appropriate for the direction of u will then have to be used. 

This solution of the Schrodinger equation then results in the eigenvalues 
for the rotational energy 

Erot=W{J) = ^J{J+l), (154) 

where J is the quantum number of angular momentum, which has integer 
values 



J = 0,l,2,... . 

Equation (jl54p is correct only for a molecule that is completely rigid; for a 
slightly elastic molecule, will increase with the rotational energy due to 
centrifugal stretching. (There is also the additional complication that even 
in the ground vibrational state there is still a zero point vibration; this will 
be discussed after the concept of centrifugal stretching is presented.) For 
centrifugal stretching, the rotational energy is modified to first order as: 

Erot = W{J) = — J( J + 1) - hD [J{J + i f . (155) 
Introducing the rotational constant 

and the constant for centrifugal stretching the pure rotation spectrum for 
electric dipole transitions AJ = +1 (emission) or AJ = —1 (absorption) is 
given by the following expression: 

z/( J) = i [W{J + 1) - W{J)] = 2 Be ( J + 1) - 4L> ( J + 1)3 . (157) 



Since D is positive, the observed line frequencies will be lower than those 
predicted on the basis of a perfectly rigid rotator. Typically, the size of D is 
about 10~^ of the magnitude of Be for most molecules. In Fig.[T4l we show a 
parameterized plot of the behavior of energy above ground and line frequency 
of a rigid rotor with and without the centrifugal distortion term. The function 
plotted vertically on the left, Erot/Bc, is proportional to the energy above 
the molecular ground state. This function is given by 



Erot/Bc ^ 2TTh J( J + 1) - 2nhD/Bc [J{J + if 



(158) 
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Fig. 14. A scliematic plot of rotational energy levels for a rotor. The horizontal 
bars in the upper part represent the rotational energy levels for a rigid rotator 
[right part) and one deformed by centrifugal stretching with D/Bc — 10~^ [left 
part). In fact, most molecules have D/Bc ~ 10""". The resulting line frequencies, u, 
are shown in the lower part. The numbers next to f refer to the J values [38) . 



while those on the right are given by the first term of (|158p only. Directly 
below the energy level plots is a plot of the line frequencies for a number of 
transitions with quantum number J. The deviation between rigid rotor and 
actual frequencies becomes rapidly larger with increasing J, and in the sense 
that the actual frequencies are always lower than the frequencies predicted 
on the basis of a rigid rotor model. In Fig. [15] we show plots of the energies 
above ground state for a number of diatomic and triatomic linear molecules. 

Allowed dipole radiative transitions will occur between different rotational 
states only if the molecule possesses a permanent electric dipole moment; that 
is, the molecule must be polar. Homonuclear diatomic molecules like H2, N2 
or O2 do not possess permanent electric dipole moments. Thus they cannot 
undergo allowed transitions. This is one reason why it was so difficult to 
detect these species. 

For molecules with permanent dipole moments, a classical picture of 
molecular line radiation can be used to determine the angular distribution of 
the radiation. In the plane of rotation, the dipole moment can be viewed as 
an antenna, oscillating as the molecule rotates. Classically, the acceleration of 
positive and negative charges gives rise to radiation whose frequency is that 
of the rotation frequency. For a dipole transition the most intense radiation 
occurs in the plane of rotation of the molecule. In the quantum mechanical 
model, the angular momentum is quantized, so that the radiation is emitted 
at discrete frequencies. Dipole radiative transitions occur with a change in 
the angular momentum quantum number of one unit, that is, AJ = ±1. The 
parity of the initial and final states must be opposite for dipole radiation to 
occur. 
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Fig. 15. Rotational energy levels of the vibrational ground states of some linear 
molecules which are commonly found in the interstellar medium (Taken from [53j ) . 



12.3 Hyperfine Structure in Linear Molecules 

The magnetic dipole or electric quadrupole of nuclei interact with electrons 
or other nuclei. These give rise to hyperfine structure in molecules such as 
HCN, HNC and HC3N. For example, the ^''N and Deuterium nuclei have 
spin 1=1 and thus a nonzero quadrupole moment. The hyperfine splitting 
of energy levels depends on the position of the nucleus in the molecule; the 
effect is smaller for HNC than for HCN. fn general, the effect is of order of a 
few MHz, and decreases with increasing J. For nuclei with magnetic dipole 
moments, such as "'^^C or ^^O, the hyperfine splitting is smaller. In the case 
of hyperfine structure, the total quantum number = J + / is conserved. 
Allowed transitions obey the selection rule AF = ±1, but not AF = 0^0. 

12.4 Vibrational Transitions 

If any of the nuclei of a molecule suffers a displacement from its equilibrium 
distance r^, it will on release perform an oscillation about r^. The Schrodinger 
equation for this is 



where x = r — rc and P{r) is the potential function. If we have the simple 
harmonic approximation (Fig. ll6p with the classical oscillation frequency 
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Fig. 16. Vibrational energy levels, eigenstates ( ) and probability densities (- 

for a harmonic oscillators 1381. 



LJ = 2Tny= J— = a\ — , (160) 
V m V m 

and (|159p has the eigenvalue 

W^'^ = W{v) = hu;iv + ^) (161) 

with 

v = 0,1,2,.... (162) 

The solutions ^p^^^{x) can be expressed with the help of Hermite polynomi- 
als. For the same rotational quantum numbers, lines arising from transitions 
in different vibrational states, in a harmonic potential, are separated by a 
constant frequency interval. 

For large x, the accuracy of the harmonic motion approximation is no 
longer suflicient even an empirical expression, will have to be introduced into 
(|159p . The resulting differential equation can no longer be solved analytically, 
so numerical methods have to be used. 

A molecule consisting of only two nuclei can vibrate only in one direction; 
it has only one vibrational mode. The situation is more complex for molecules 
with three or more nuclei. In this multitude of various vibrational 

modes may exist, each of which will result in its own ladder of vibrational 
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states, some of which may be degenerate. For a certain molecular vibrational 
state, there are many internal rotational states. Vibrational motions along the 
molecular axis can be and usually are hindered in the sense that these are 
subject to centrifugal forces, and thus must overcome an additional barrier. 

It is possible to have transitions between rotational energy levels in a given 
vibrationally excited state. An example is the J = 1 — rotational transition 
of the SiO molecule from the u = 0, 1 and 2 levels. The dipole moment in 
a vibrational state is usually the same as in the ground state. The dipole 
moment for a purely vibrational transition in the case of diatomic molecules 
is usually about 0.1 Debye. A more complex example is the polyatomic linear 
molecule HC3N, for which a number of transitions have been measured in 
the ISM. 



12.5 Line Intensities of Linear Molecules 

In this section, we will give the details needed to relate the observed line 
intensities to column densities of the species emitting the transition. In the 
Born-Oppenheimer approximation, the total energy can be written as a sum 
of the electronic, vibrational and rotational energies in (|148|) . In the line spec- 
trum of a molecule, transitions between electronic, vibrational and rotational 
states are possible. We will restrict the discussion to rotational transitions, 
and in a few cases to vibrational transitions. 

Computations of molecular line intensities proceed following the princi- 
ples outlined in conjunction with the Einstein coefficients. The radial part of 
molecular wavefunctions is extremely complex. For any molecular or atomic 
system, the spontaneous transition probability, in s^^, for a transition be- 
tween an upper, m, and lower, I, level is given by the Einstein A coefficient 
Aui- In the CGS system of units, A^i is given by (|132p . Inserting numerical 
values, we have: 

A„i = 1.165 X 10-" i/^l/iuiP- (163) 

The units of the line frequency are GHz, and the units of /i are Debyes 
(i.e., 1 Debye = 10~^* e.s.u.) Eg. 11631 is a completely general relation for any 
transition. The expression contains a term which depends on the integral 
over the angular part of the wavefunctions of the final and initial states; 
the radial part of the wavefunctions is contained in the value of the dipole 
moment, /i (This is usually determined from laboratory measurements). For 
dipole transitions between two rotational levels of a linear molecule, J^J+1, 
there can be either absorption or emission. For the case of absorption, for a 
dipole moment — ImjPj have: 

ImjP^m'^^ for J^J + 1 (164) 



while for emission, the expression is given by: 
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^ 2JT3 J+^^J (165) 

where is the spectral line frequency. Here, fi is the permanent electric dipole 
moment of the molecule. Table 112.51 contains parameters for a few species 
found in the interstellar medium. 



Table 2. Parameters of the commonly observed short cm/mm molecular lines 



Chemical^ 
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Molecule 
name 


Transition 




i//GHz 


e„/k'' 
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NH3 
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= (bl) 
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NH3 


para-ammonia 


(J,K) 
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— (2 2~l 


23.722634 
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X 10 


-7 


NH3 


ortho-ammonia 


(J,K) 


= (3,3) 




23 8701 30 
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2.5 


X 10 


-7 


SiO 


silicon monoxide* 


J = 1 


- 0, II = 


2 




3512 


3 


X 10 


-6 


SiO 


silicon monoxide* 


J = 1 


- 0, II = 
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43.122080 
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X 10 


-6 


SiO 


silicon monoxide 


J = 1 


- 0,v = 


Q 


^O.Ti.iOO'JO 


2 1 
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X 10 


-6 
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carbon monosulfidc 


J = 1 


- 




48 QQnQfi4 


2.4 


1 8 


X 10 


-6 


DCO''' 


deuterated formylium J — 1 


- 




72 n3Q331 


3.5 


2.2 


X 10 


-5 


SiO 


silicon monoxide* 


J = 2 


- 1, 11 = 


2 


85.640456 


3516 


2.0 


X 10 


-5 


SiO 


silicon monoxide* 


J = 2 


- 1, tl = 


1 


86.243442 


1774 


2.0 


xlO 


- = 




formylium 


J = 1 


- 




86.754294 


4.2 


3.9 


xlO 


-5 


SiO 


silicon monoxide 


J = 2 


- l,v = 





86.846998 


6.2 


2.0 


xlO 




HCN 


hydrogen cyanide 


J = 1 


-Q,F = 


2-1 


88.631847 


4.3 


2.4 


XlO 


-5 


HCO+ 


formylium 


J = 1 


- 




89.188518 


4.3 


4.2 


XlO 


-5 


HNC 


hydrogen isocyanidc 


,7=1 


-0,F = 


2-1 


90.663574 


4.3 


2.7 


XlO 


-5 


N2H+ 


diazenylium 


,7=1 


- 0, Fi 


= 2-1, 
















F 


= 3-2 




93.173809 


4.3 


3.8 


XlO 


-5 


CS 


carbon monosulfidc 


J = 2 


- 1 




97.980968 


7.1 


2.2 


XlO 


-5 


c^^o 


carbon monoxide 


J = 1 


- 




109.782182 


5.3 


6.5 


XlO 


-8 


^^co 


carbon monoxide 


J = 1 


- 




110.201370 


5.3 


6.5 


XlO 


-8 


CO 


carbon monoxide 


J = 1 


- 




115.271203 


5.5 


7.4 


XlO 


-8 


Ha^^CO 


ortho- formaldehyde 


Jk^Kc — 2l2 - 


- Ill 


137.449959 


22 


5.3 


XlO 


-5 


H2CO 


ortho- formaldehyde 


Jk^Kc — 2l2 - 


- Ill 


140.839518 


22 


5.3 


XlO 




CS 


carbon monosulfide 


J = 3 


- 2 




146.969049 


14.2 


6.1 


XlO 




c^^o 


carbon monoxide 


J = 2 


- 1 




219.560319 


15.9 


6.2 


XlO 


-7 


^^co 


carbon monoxide 


J = 2 


- 1 




220.398714 


15.9 


6.2 


XlO 


-7 


CO 


carbon monoxide 


J = 2 


- 1 




230.538001 


16.6 


7.1 


XlO 


-7 


CS 


carbon monosulfide 


J = 5 


- 4 




244.935606 


33.9 


3.0 


XlO 


-4 


HCN 


hydrogen cyanide 


J = 3 


- 2 




265.886432 


25.5 


8.5 


XlO 


-4 


HCO+ 


formylium 


J = 3 


- 2 




267.557625 


25.7 


1.4 


XlO 


-3 


HNC 


hydrogen isocyanidc 


J = 3 


- 2 




271.981067 


26.1 


9.2 


XlO 


-4 


^ If isotope not explicitly given, 

is N, ^^Si is Si,^^S is S 
^ Energy of upper level above g 


this is the most abundant variety, i.e., 
ound, in Kelvin 


is C, 


^"^O is O, 



Spontaneous transition rate, i.e., the Einstein A coefficient 
* Always found to be a mascr transition 
** Often found to be a maser transition 



After inserting (|165p into (|163p we obtain the expression for dipole emis- 
sion between two levels of a linear molecule: 



Aj = 1.165 X 10"" i/^ 



J + 1 
2J + 3 



for J + 1 ^ J 



(166) 
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where A is in units of s~^, /ij is in Debyes (i.e. 10^^ times e.s.u. values), 
and V is in GHz. This expression is vaUd for a dipole transition in a hnear 
molecule, from a level J + 1 to J. 

Inserting the expression for A in (|127p . the general relation between line 
optical depth, column density in a level / and excitation temperature, Tex, is: 



Ni = 93.5 



9i ^ 



gu Aui [1 - exp(-4.80 x IQ-^iy/T,^)] 



■ dv 



(167) 



where the units for v are GHz and the linewidths are in kms~^. n is the local 
density in units of cm~^, and N = nl is the column density, in cm~^. 

Although this expression appears simple, this is deceptive, since there is 
a dependence on Tqx. The excitation process may cause Tex to take on a wide 
range of values. If Tc-k/v ^ 4.80 x 10^^ K, the expression becomes: 



Nl 1.94 X 10' 



; gi Tgy. 

gu At^i 



■ dw 



(168) 



Values for Tex are difficult to obtain in the general case. Looking ahead a 
bit, for the J = 1 — > and J = 2 ^ 1 transitions, CO molecules are found 
to be almost always close to LTE, so it is possible to obtain estimates of 
Tk from (|172p . This result could be used in (|168p if the transition is close 
to LTE. Expression (|168p can be simplified even further if r <C 1. Then, if 
the source fills the main beam (this is the usual assumption) the following 
relation holds: 



MB 



(169) 



where the term Tmb represents the main beam brightness temperature. In 
the general case, we will use Tb, which depends on source size. Inserting this 
in (|168p . we have: 



Nl = 1.94 X 10^ ^'^ / Tedw 

gu Aui 



(170) 



In this relation, Tgx appears nowhere. Thus, for an optically thin emission 
line, excitation plays no role in determining the column density in the energy 
levels giving rise to the transition. The units are as before; the column density, 
Nl , is an average over the telescope beam. 



12.5.1 Total Column Densities of CO Under LTE Conditions 



We apply the concepts developed in the last section to carbon monoxide, 
a simple molecule that is abundant in the ISM. Microwave radiation from 
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this molecule is rather easily detectable because CO has a permanent dipole 
moment of /i — 0.112 Debye. CO is a diatomic molecule with a simple lad- 
der of rotational levels spaced such that the lowest transitions are in the 
millimeter wavelength region. A first approximation of the abundance of the 
CO molecules can be obtained by a very standard LTE analysis of the CO 
line radiation; this is also fairly realistic since the excitation of low rotational 
transitions is usually close to LTE. Stable isotopes exist for both C and O 
and several isotopic species of CO have been measured in the interstellar 



medium; among these are ^^C^^O, ^^C^^O, ^^C^^O, ^^C^^O and ^^C^^O. 



For the distribution of CO, we adopt the simplest geometry, that is, an 
isothermal slab which is much larger than the telescope beam. Then the 
solution (|25|) may be used. If we recall that a baseline is usually subtracted 
from the measured line profile, and that the 2.7 K microwave background 
radiation is present everywhere, the appropriate formula is 



where Tq — hv/k. On the right side of (|17ip there are two unknown quanti- 
ties: the excitation temperature of the line, Tox, and the optical depth, t^. If 
is known it is possible to solve for the column density iVco in the case 
of the line A = 21cm of Hi. But in the case of CO we meet the difficulty 
that lines of the most abundant isotope ^^C ^^O always seem to be optically 
thick. It is therefore not possible to derive information about the CO column 
density from this line without a model for the molecular clouds. Here we give 
an analysis based on the measurement of weaker isotope lines of CO. This 
procedure can be applied if the following assumptions are valid. 

- All molecules along the line of sight possess a uniform excitation temper- 
ature in the J = 1 — > transition. 

- The different isotopic species have the same excitation temperatures. Usu- 
ally the excitation temperature is taken to be the kinetic temperature of 
the gas, Tk- 

- The optical depth in the ^-^0^^07=1^0 line is large compared to 
unity. 

- The optical depth in a rarer isotopomer transition, such as the ^^C ^^O 
J = 1 ^ line is small compared to unity. 

- The ^^CO and CO lines are emitted from the same volume. 

Given these assumptions, we have Tex = Ik = T, where Tk is the kinetic 
temperature, which is the only parameter in the Maxwell-Boltzmann relation 
for the cloud in question. In the remainder of this section and in the following 
section we will use the expression T, since all temperatures are assumed to 
be equal. This is certainly not true in general. Usually, the molecular energy 
level populations are often characterized by at least one other temperature, 




(171) 



T 



ex- 
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In general, the lines of ^^C ^^O are optically thick. Then, in the absence 
of background continuum sources, the excitation temperature can be deter- 
mined from the appropriate of the optically thick J = 1 — line of ^^C ^^O 
at 115.271GHz: 



T = 5.5 / In 1 



5.5 



+ 0.82 



(172) 



The optical depth of the ^^C ^^^O line at 110.201 GHz is obtained by solving 
(HZH) for 



ro^^ = -ln 



13 



1 - 



5.3 



exp 



5^ 



0.16 



(173) 



Usually the total column density is the quantity of interest. To obtain 
this for CO, one must sum over all energy levels of the molecule. This can be 
carried out for the LTE case in a simple way. For non-LTE conditions, the 
calculation is considerably more complicated. For more complex situations, 
statistical equilibrium or LVG (|137p models are needed. In this section, we 
concentrate on the case of CO populations in LTE. 

For CO, there is no statistical weight factor due to spin degeneracy. In a 
level J, the degeneracy is 2J + 1. Then the fraction of the total population 
in a particular state, J, is given by: 



7V(J)/iV(total) = 1^:L±11 exp 



hB^J{J+l) 
kf 



(174) 



Z is the sum over all states, or the Partition function. If vibrationally excited 
states are not populated, Z can be expressed as: 



Z = ^ (2J+ 1) exp 



hB^J{J +1) 



kT 



(175) 



The total population, A^(total) is given by the measured column density for a 
specific level, N{J), divided by the calculated fraction of the total population 
in this level: 



iV(total) N{J) 



(2J+ 1) 



exp 



/iSe J(J+ 1) 
kf 



(176) 



This fraction is based on the assumption that all energy levels are populated 
under LTE conditions. For a temperature, T, the population will increase 
as 2 J + 1, until the energy above the ground state becomes large compared 
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to T. Then the negative exponential becomes the significant factor and the 
population will quickly decrease. If the temperature is large compared to the 
separation of energy levels, the sum can be approximated by an integral, 



kT 
'hK. 



for hB^-^kT 



(177) 



Here Bf, is the rotation constant p56p . and the molecular population is as- 
sumed to be characterized by a single temperature, T, so that the Boltzmann 
distribution can be applied. Applying (jl76p to the J — Q level, we can obtain 
the total column density of ^^CO from a measurement of the J = 1 ^ line 
of CO and ^'^CO, using the partition function of CO, from (|177p . and (|167p : 



T 



iV(total)^3^ = 3.0 X 10" 



-13 



{v) Av 



l-exp{-5.3/r} 



(178) 



It is often the case that in dense molecular clouds ^'^CO is optically thick. 
Then we should make use of an even rarer substitution, C^^O. For the J — 
1^0 line of this substitution, the expression is exactly the same as (|178p . 
For the J = 2 —t 1 line, we obtain a similar expression, using ()176|) : 



iV(total)^^o = 1-5 X 10^ 



T exp{5.3/T} / - 
t J 



.13 



(u) du 



1 -cxp{-10.6/r} 



(179) 



In both (|178p and (|179|) , the beam averaged column density of carbon monox- 
ide is in units of cm^^ the line temperatures are in Kelvin, main beam bright- 
ness temperature and the velocities, f, are in kms~^. If the value of T ^ 10.6 
or 5.3 K, the exponentials can be expanded to first order and then these re- 
lations become simpler. 

In the limit of optically thin lines, integrals involving t(v) are equal to 
the integrated line intensity J rMB(^^) du, as mentioned before. However, there 
will be a dependence on T^x in these relations because of the Partition func- 
tion. The relation T t{v) = 7mb(^^) is only approximately true. However, 
optical depth effects can be eliminated to some extent by using the approxi- 
mation 



T I t{v) Av ' 



To 



TMBiv)Av. 



(180) 



This formula is accurate to 15% for tq < 2, and it always overestimates N 
when To > 1. The formulas (fTTS]) . (fTTS]) . (fT78| and (fT79| permit an evaluation 
of the column density only under the assumption of LTE. 
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The most extensive and complete survey in the J = 1 — line of ^^COwas 
carried out by the Boston University FCRAO group [T^. This covers the 
inner part of the northern galaxy with full sampling; there are nearly 2 x 10^ 
spectra. 

For other linear molecules, the expressions for the dipole moments and 
the partition functions are similar to that for CO and the treatment is sim- 
ilar. There is one very important difference however. The simplicity in the 
treatment of the CO molecule arises because of the assumption of LTE or 
near-LTE condtions. This may not be the case for molecules such as HCN 
or CS since these species have dipole moments of order 2 to 3 Debye. Thus 
populations of high J levels (which have faster spontaneous decay rates) may 
have populations lower than predicted by LTE calculations. Such populations 
are said to be subthermal, because the excitation temperature characterizing 
the populations would be T^x < Jk. 

12.6 Symmetric Top Molecules 
12.6.1 Energy Levels 

Symmetric and asymmetric top molecules are vastly more complex than lin- 
ear molecules. The rotation of a rigid molecule with an arbitrary shape can 
be considered to be the superposition of three free rotations about the three 
principal axes of the inertial ellipsoid. Depending on the symmetry of the 
molecule these principal axes can all be different: in that case the molecule 
is an asymmetric top. If two principal axes are equal, the molecule is a sym- 
metric top. If all three principal axes are equal, it is a spherical top. In order 
to compute the angular parts of the wavefunction, the proper Hamiltonian 
operator must be solved in the Schrodinger equation and the stationary state 
eigenvalues determined. 

In general, for any rigid rotor asymmetric top molecule in a stable state, 
the total momentum J will remain constant with respect to both its absolute 
value and its direction. As is known from atomic physics, this means that both 
( J)^ and the projection of J into an arbitrary but fixed direction, for example 
Jz, remain constant. If the molecule is in addition symmetric, the projection 
of J on the axis of symmetry will be constant also. 

For the symmetric top molecule, J is inclined with respect to the axis 
of symmetry z. Then the figure axis z will precess around the direction J 
forming a constant angle with it, and the molecule will simultaneously rotate 
around the z axis with the constant angular momentum Jz . From the defini- 
tion of a symmetric top, 0x = Oy- Taking Ox = Qy = 0± and Oz = 0\\, we 
obtain a Hamiltonian operator: 




(181) 
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Its eigenvalues are: 

H.(J.A-).J(J + l)^ + if»6^(^-jl-) (182) 

where K"^ is the eigenvalue from the operator and = + Jy + is 
the eigenvalue from the operator + Jy + J^. 

The analysis of linear molecules is a subset of that for symmetric molecules. 
For linear molecules, 0\\ ^ so that 1/(2 6>||) oo. Then finite energies in 
(|182p are possible only \i K = Q. For these cases the energies are given 
by (|154p . For symmetric top molecules each eigenvalue has a multiplicity of 
2J+ 1. 

J = 0,l,2,... ii: = 0,±l,±2,...± J. (183) 

From (|182p . the energy is independent of the sign of so levels with the 
same J and absolute value of K coincide. Then levels with > are doubly 
degenerate. 

It is usual to express ^— as B, and — as C. The units of these 

47r6'j. 47r6'|| 

rotational constants, B and C are usually either MHz or GHz. Then (|182p 

becomes 

W{J,K)/h = B J{J + {C - B) . (184) 
12.6.2 Spin Statistics 

In the case of molecules containing identical nuclei, the exchange of such 
nuclei, for example by the rotation about an axis, has a spectacular effect on 
the selection rules. Usually there are no interactions between electron spin and 
rotational motion. Then the total wavefunction is the product of the spin and 
rotational wavefunctions. Under an interchange of fermions, the total wave 
function must be antisymmetric (these identical nuclei could be protons or 
have an uneven number of nucleons) . The symmetry of the spin wavefunction 
of the molecule will depend on the relative orientation of the spins. If the spin 
wavefunction is symmetric, this is the ori/io-modification of the molecule; if 
antisymmetric it is the para-modification. In thermal equilibrium in the ISM, 
collisions with the exchange of identical particles will change one modification 
into the other only very slowly, on time scales of > 10^ years. This could 
occur much more quickly on grain surfaces, or with charged particles. If the 
exchange is slow, the ortho and para modifications of a particular species 
behave like diff'erent molecules; a comparison of ortho and para populations 
might give an estimate of temperatures in the distant past, perhaps at the 
time of molecular formation. 

For the H2 molecule, the symmetry of the rotational wavefunction de- 
pends on the total angular momentum J as (— l)*^. In the J = state the 
rotational wavefunction is symmetric. However, the total wavefunction must 
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be antisymmetric since protons are fermions. Thus, the J — 0,2,4, etc., ro- 
tational levels are para-H2, while the J = 1, 3, 5, etc., are ortho-H2. Spectral 
lines can connect only one modification. In the case of H2, dipole rotational 
transitions are not allowed, but quadrupole rotational transitions (Z\ J — ±2) 
are. Thus, the 28|J,m line of H2 connects the J = 2 and J = levels of 
para-H2. Transitions between the ground and vibrational states are are also 
possible. 

Finally, as a more complex example of the relation of identical nuclei, we 
consider the case of three identical nuclei. This is the case for NH3, CH3CN 
and CH3C2H. Exchanging two of the nuclei is equivalent to a rotation by 
120°. An exchange as was used for the case of two nuclei would not, in 
general, lead to a suitable symmetry. Instead combinations of spin states 
must be used. These lead to the result that the ortho to para ratio is two 
to one if the identical nuclei are protons. That is, NH3, CII3CN or CH3C2H 
the ortho form has S{J^K) = 2, while the para form has S{J,K) = 1. In 
summary, the division of molecules with identical nuclei into ortho and para 
species determines selection rules for radiative transitions and also rules the 
for collisions (see e. g. [49]). 

12.6.3 Hyperfine Structure 

For symmetric top molecules, the simplest hyperfine spectra is found for the 
inversion doublet transitions of NH3. Since both the upper and lower levels 
have the same quantum numbers {J,K), there will be 5 groups of hyperfine 
components separated by a few MHz. Because of interactions between the 
spins of H nuclei there will be an additional splitting, within each group, of 
order a few kHz. In Table 15.1 we give the relative intensities of the NH3 
satellites for the case of low optical depth and LTE. For a molecule such as 
OH, one of the electrons is unpaired. The interaction of the nuclear magnetic 
moment with the magnetic moment of an unpaired electron is described as 
magnetic hyperfine structure. This splits a specific line into a number of 
components. In the case of the OH molecule, this interaction gives rise to 
a hyperfine splitting of the energy levels, in addition to the much larger A 
doubling. Together with the A doublet splitting, this gives rise to a quartet 
of energy levels in the OH ground state. Transitions between these energy 
levels produces the four ground state lines of OH at 18 cm wavelength (see 
Fig.[2l]). 

Table 3. Intensities of satellite groups relative to the Main Component (see [38]). 

(J,K)|(1,1) (2,2) (3,3) (4,4) (5,5) (6,6) (2,1) 

linner 0.295 0.0651 0.0300 0.0174 0.0117 0.0081 0.0651 
louter 0.238 0.0628 0.0296 0.0173 0.0114 0.0081 0.0628 
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NH3 is an example of an oblate symmetric top molecule commonly found 
in the ISM. A diagram of the lower energy levels of NH3 are shown in Fig.fTTl 
A prolate top molecule has a cigar-like shape. Then A replaces C, and A > B. 
The energy-level diagrams for prolate symmetric top molecules found in the 
ISM, such as CII3CCH and CH3CN, follow this rule. However, since these 
molecules are much heavier than NH3, the rotational transitions give rise to 
lines in the millimeter wavelength range. 

Differences in the orientation of the nuclei can be of importance. If a 
reflection of all particles about the center of mass leads to a configuration 
which cannot be obtained by a rotation of the molecule, so these reflections 
represent two different states. For NH3, we show this situation in the upper 
part of Fig.dTl Then there are two separate, degenerate states which exist 
for each value of {J,K) for J > I. (The K = ladder is an exception 
because of the Pauli principle.) These states are doubly degenerate as long 
as the potential barrier separating the two configurations is infinitely high. 
However, in molecules such as NH3 the two configurations are separated only 
by a small potential barrier. This gives rise to a measurable splitting of the 
degenerate energy levels, which is referred to as inversion doubling. For NH3, 
transitions between these inversion doublet levels are caused by the quantum 
mechanical tunneling of the nitrogen nucleus through the plane of the three 
protons. The wavefunctions of the two inversion doublet states have opposite 
parities, so that dipole transitions are possible. Thus dipole transitions can 
occur between states with the same (J, K) quantum numbers. The splitting 
of the (J, K) levels for NH3 shown in Fig. [IT] is exaggerated; the inversion 
transitions give rise to spectral lines in the wavelength range near 1 cm. For 
CH3CCH or CH3CN, the splitting caused by inversion doubling is very small 
since the barrier is much higher than for NH3. 

The direction of the dipole moment of symmetric top molecules is parallel 
to the K axis. Spectral line radiation can be emitted only by a changing dipole 
moment. Since radiation will be emitted perpendicular to the direction of the 
dipole moment, there can be no radiation along the symmetry axis. Thus the 
K quantum number cannot change in dipole radiation, so allowed dipole 
transitions cannot connect different K ladders. The different K ladders are 
connected by octopole radiative transitions which require AK = ±3. These 
are very slow, however, and collisions are far more likely to cause an exchange 
of population between different K ladders. This is used to estimate Tk from 
the ratio of populations of different (J, K) states in symmetric top molecules. 



12.6.4 Line Intensities and Column Densities 

The extension of this analysis to symmetric top molecules is only slightly 
more complex. The dipole moment for an allowed transition between energy 
level J + 1,K and J, K for a symmetric top such as CH3CN or CH3C2H is 

M'=t^' lj^^f^2J + l) iJ + l,K)^{J,K). (185) 
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Fig. 17. The energy-level diagram of the vibrational ground state of NH3, a prolate 
symmetric top molecule. Ortho-NHs has -R' = 0, 3, 6, 9, • • •, while para-NHs has all 
other K values (see text). Rotational transitions with AJ = 1, AK — 0, give rise to 
lines in the far IR. This molecule also has transitions with AJ = 0, AK — between 
inversion doublet levels. The interaction of the nuclear spin of with the electrons 
causes quadrupole hyperfine structure. In the AJ — 0, AK = transitions, the line 
is split into 5 groups of components. A sketch of the structure of the groups of 
hyperfine components of the {J,K) = (1, 1) inversion doublet line is indicated in 
the lower right; the separation is of order of MHz. In the upper right is a sketch of 
the molecule before and after an inversion transition, which gives rise to a 1.3 cm 
photons [55] . 
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For these transitions, J > K always. 

For NH3, the most commonly observed spectral lines are the inversion 
transitions at 1.3 cm between levels (J, K) and (J, K). The dipole moment is 



J(J+ 1) 



for AJ 0, Z\/C = . 



(186) 



When these relations are inserted in (jl63p , the population of a specific level 
can be calculated following (|167p . If we follow the analysis used for CO, we 
can use the LTE assumption to obtain the entire population 



iV(total) = N{J, K) 



Z 



{2J+1)S{J,K) 



exp 



W{J, K) 
kT 



(187) 



where W is the energy of the level above the ground state, and the nuclear 
spin statistics are accounted for through the factor S'( J, K) for the energy 
level corresponding to the transition measured. For symmetric top molecules, 
we have, using the expression for the energy of the level in question (|184p , 



A^(total) 



ZN{J,K) 
{2J+1)S{J,K) 



exp 



BJ{J+1) + {C-B)K'^ 



kT 



(188) 



For prolate tops, A replaces C in ()188|) . and in p89p to ()193p . If we sum 
over all energy levels, we obtain A'^(total), the partition function, Z in the 
following: 



00 K=J 



Z=J2T. {2J+l)S{J,K)exp 



J=0 K=0 



BJ{J + 1) + {C-B)K^ 



kT 



(189) 



If the temperature is large compared to the spacing between energy levels, 
one can replace the sums by integrals, so that: 



Z : 



7r(fcr)3 



(190) 



If we assume that hv <^ kT, use CGS units for the physical constants, and 
GHz for the rotational constants A, B and C, the partition function, Z, 
becomes 



Z « 168.7 
Substituting into (I189p . we have: 



168.7 



A^(total) = N{J, K) 



r3 



T^ 
B^C 



(191) 



i2J+l)S{J,K) 



exp 



W{J, K) 



kT 



(192) 
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Here, N{J,K) can be calculated from (|167p or (|168p . using the appropriate 
expressions for the dipole moment, (|186p . in the Einstein A coefficient rela- 
tion, (|163p and W is the energy of the level above the ground state. In the 
ISM, ammonia inversion lines up to (J,K)=(18,18) have been detected |52) . 

We now consider a situation in which the NH3 population is not thermal- 
ized. This is typically the case for dark dust clouds. We must use some con- 
cepts presented in the next few sections for this analysis. If n(H2)~ 10^ cm""^, 
and the infrared field intensity is small, a symmetric top molecule such as 
NH3 can have a number of excitation temperatures. The excitation temper- 
atures of the populations in doublet levels are usually between 2.7 K and Tk- 
The rotational temperature, Trot, which describes populations for metastable 
levels (J = K) in different K ladders, is usually close to Tk- This is because 
radiative transitions between states with a different K value are forbidden to 
first order. The excitation temperature which describes the populations with 
different J values within a given K ladder will be close to 2.7 K, since radia- 
tive decay with Z\K=0, A3=l is allowed. Then the non-metastable energy 
levels, (J > K), are not populated. In this case, Z is simply given by the sum 
over the populations of metastable levels: 



Z{J = K) 
j=o 



kT 



(193) 



For the NH3 molecule in dark dust clouds, where Tk = 10 K and n(Il2)= 
10^ cm~^, we can safely restrict the sum to the three lowest metastable levels: 

Z{J = K)k N{Q, 0) -I- 7V(1, 1) + N{2, 2) + 7V(3, 3) . (194) 

Substituting the values for NII3 metastable levels: 



Z{J = K) 
«iV(l,l) 



1 /23.1\ 5 / 41.2\ 14 / 99.4 



exp — — + i -f - exp - — — + — exp 



3 \ kT J 3 \ kT J 3 ' \ kT 

(195) 



For NII3 we have given two extreme situations: in the first case, described by 
(|195p . is a low-density cloud for which only the few lowest metastable levels 
are populated. The second case is the LTE relation, given in (|192p . This 
represents a cloud in which the populations of the molecule in question are 
thermalized. More complex arc those situations for which the populations of 
some of the levels are thermalized, and others not. Such a situation requires 
the use of a statistical equilibrium or an LVG model. 



92 



T. L. Wilson 




1400 - 



E 1200 

LU 

^ 1000 - 
Q 

i 800 

O 
a: 

uj 600 
> 

o 

CQ 

< 400 
>- 

o 

UJ 200 



LU 







M M M M I ni 



^62 



83_5 ^32i»fe^60 



624 ^"2 



6 06 

— 52 



- 4,, _ 



4,1 - 



404 , 
— -^2: 



313^83 GHz22l 
202— ~ 



PARA HpO ^lii 
I I I I I I I I I I I- 



iol M ' I ' I ' I ' U 

945 

- 1101 

936321 GHz1029 _ 



0_i5 IO110 
Sl8 ~ 



743 



532 



643 



625 



707 



441- 



52 3,22 GHz6 l6 

I- 4 !^r~ 



530 



'23 



321 3B0GHz4i4 

^1 ^3 

I- 2i2 



lioi^r ORTHO H2 o 
■ I I I I I I I I I I I- 



10 8642002468 10 
Ka Kc 



Fig. 18. Energy level diagrams for ortho- and para-H20. This is an asymmetric 
top molecule, with the dipole moment along the B axis, that is, the axis with an 
intermediate moment of inertia. Because of the two identical nuclei, the energy 
level diagram is split into ortho and para, that show almost no interaction under 
interstellar conditions. The transitions marked by arrows are well-known masers 
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12.7 Asymmetric Top Molecules 

12.7.1 Energy Levels 

For an asymmetric top molecule there are no internal molecular axes with a 
time-invariable component of angular momentum. So only the total angular 
momentum is conserved and we have only J as a good quantum number. The 
moments of inertia about each axis are different; the rotational constants are 
referred to as A, B and C, with A > B > C. The prolate symmetric top 
{B ~ C) or oblate symmetric top {B — A) molecules can be considered as 
the limiting cases. But neither the eigenstates nor the eigenvalues are easily 
expressed in explicit form. Each of the levels must be characterized by three 
quantum numbers. One choice is Jk^k^, where J is the total angular momen- 
tum, Ka is the component of J along the A axis and K,. is the component 
along the C axis. If the molecule is a prolate symmetric top, J and Ka are 
good quantum numbers; if the molecule were an oblate symmetric top, J and 
Kc would be good quantum numbers. Intermediate states are characterized 
by a superposition of the prolate and oblate descriptions. In Fig. [191 we show 
the energy level diagram for the lower levels of II2CO, which is almost a 
prolate symmetric top molecule with the dipole moment along the A axis. 
Since radiation must be emitted perpendicular to the direction of the dipole 
moment, for H2CO there can be no radiation emitted along the A axis, so 
the quantum number Ka will not change in radiative transitions. 

12.7.2 Spin Statistics and Selection Rules 

The case of a planar molecule with two equivalent nuclei, such as II2CO, 
shows is a striking illustration of these effects (see Fig.[T9|). The dipole mo- 
ment lies along the A axis. A rotation by 180° about this axis will change 
nothing in the molecule, but will exchange the two protons. Since the pro- 
tons are fermions, this exchange must lead to an antisymmetric wavefunction. 
Then the symmetry of the spin wave function and the wave function describ- 
ing the rotation about the A axis must be antisymmetric. The rotational 
symmetry is (—1)^". If the proton spins are parallel, that is ortho-Il2CO, 
then the wave function for Ka must be anti-symmetric, or Ka must take on 
an odd value. If the proton spins are anti-parallel, for para-H2C0, Ka must 
have an even value (Fig. [19]). For ortho-H2CO, the parallel spin case, there 
are three possible spin orientations. For para-Il2C0, there is only one pos- 
sible orientation, so the ratio of ortho-to-para states is three. Such an effect 
is taken into account in partition functions p75p by spin degeneracy factors, 
which are denoted by the symbol S{J, K). For ortho-Il2CO, 5( J, K) — 3, for 
para-H2C0, S{J,K) = 1. This concept will be applied in Sect. 1 12. 7731 

Allowed transitions can occur only between energy levels of either the 
ortho or the para species. For example, the 6cm II2CO line is emitted from 
the ortho- modification only (see Fig. [TO)) . Another example is the interstellar 
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H2O maser line at A = 1.35 cm which arises from ortho-H20 (see Fig.lTS)). 
The H2O molecule is a more complex case since the dipole moment is along 
the B axis. Then in a radiative transition, both Ka and must change 
between the initial and final state. 

12.7.3 Line Intensities and Column Densitiess 

For asymmetric molecules the moments of inertia for the three axes are all 
different; there is no symmetry, so three quantum numbers are needed to 
define an energy level. The relation between energy above the ground state 
and quantum numbers is given in Appendix IV of [49j or in databases for 
specific molecules (see figure captions for references). The relation for the 
dipole moment of a specific transition is more complex; generalizing from 
(|163p . we have, for a spontaneous transition from a higher state, denoted by 
u to a lower state, denoted by I: 



This involves a dipole moment in a direction x. As before, the units of u are 
GHz, and the units of /i are Debyes (i.e., 1 Debye = 10~^^ times the e.s.u. 
value). The value of the quantum number J' refers to the lower state. The 
expression for S{u; I), the line strength is an indication of the complexity of 
the physics of asymmetric top molecules. The expression S{u; I) is the angular 
part of the dipole moment between the initial and final state. The dipole 
moment can have a direction which is not along a single axis. In this case 
there are different values of the dipole moment along different molecular axes. 
In contrast, for symmetric top or linear molecules, there is a dipole moment 
for rotational transitions. Methods to evaluate transition probabilities for 
asymmetric molecules are discussed at length in [49]. There is a table of S{u; I) 
in their Appendix V. We give references for S{u]l) in our figure captions. 
From the expression for the Einstein A coefficient, the column density in a 
given energy level can be related to the line intensity by (|167p . Following the 
procedures used for symmetric top molecules, we can use a relation similar 
to (|189p to sum over all levels, using the appropriate energy, W, of the level 
Jk^k^ above the ground state and the factor S{Jkc,k^) for spin statistics: 



If the populations arc in LTE, one can follow a process similar to that used 
to obtain (fT9T|l . Then we obtain the appropriate expression for the partition 
function: 



A = 1.165 X 10 




S{u;l) 
2 J' + 1 



(196) 





(198) 
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When combined with the Boltzmann expression for a molecule in a specific 
energy level, this gives a simple expression for the fraction of the population 
in a specific rotational state if LTE conditions apply: 

168.7 J-j— /^x 
Ar(total) . iV( exp (-) . (199) 

In this expression, S{JKaKa) accounts for spin statistics for energy level 
jRaKoJ and A, B and C are the molecular rotational constants in GHz. 
W , the energy of the level above the ground state, and T, the temperature, 
are given in Kelvin. Given the total molecular column density and the value 
of T, the feasibility of detecting a specific line can be obtained when the 
appropriate A coefficient value is inserted into (|167|) or (|168[) . 

As pointed out in connection with NH3, T need not be Tk. In reality, 
a number of different values of T may be needed to describe the popula- 
tions. We will investigate the influence of excitation conditions on molecular 
populations and observed line intensities next. 
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Fig. 19. Energy-level diagram of formaldehyde, H2CO. This is a planar asymmetric 
top, but the asymmetry is very small. The energy-level structure is typical of an 
almost prolate symmetric top molecule. In the lower right is a sketch of the structure 
of the molecule [31] 
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Two important interstellar molecules are H2CO and H2O. Here we sum- 
marize the dipole selection rules. Rotating the molecule about the axis along 
the direction of the dipole moment, we effectively exchange two identical par- 
ticles. If these are fermions, under this exchange the total wavefunction must 
be antisymmetric. For H2CO, in Sect. I12.01 we reviewed the spin statistics. 
Since the dipole moment is along the A axis, a dipole transition must involve 
a change in the quantum numbers along the B or C axes. From Fig. 1191 the 
Ka=0 ladder is para-H2C0, so to have a total wavefunction which is anti- 
symmetric, one must have a space wavefunction which is symmetric. For a 
dipole transition, the parities of the initial and final states must have differ- 
ent parities. This is possible if the C quantum number changes. For H2O, the 
dipole moment is along the B axis, from Fig. [181 In a dipole transition, the 
quantum number for the B direction will not change. For ortho-H20, the spin 
wavefunction is symmetric, so the symmetry of the space wavefunction must 
be antisymmetric. In general, this symmetry is determined by the product of 
Ka and Kc. For ortho-Il20, this must be KaKc= (odd) (even), i.e. oe, or eo. 
For allowed transitions, one can have oe — eo or eo — oe. For para-Il20, the 
rule is 00 — ee or ee — 00. Clearly H2S follows the selection rules for II2O. 
These rules will be different for SO2 since the exchanged particles are bosons. 
More exotic are D2CO, ND3 and D2O. 
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Fig. 20. The few lowest energy levels of the H2D molecule. This is a planar, 
triangular-shaped asymmetric top |14j . 

The species H;^ has the shape of a planar triangle. It is a key to ion- 
molecule chemistry, but has no rotational transitions because of its symmetry. 
The deuterium isotopomer, H2D"*" is an asymmetric top molecule with a 
permanent dipole moment. The spectral line from the lio-lii levels ortho 
species was found at 372.421 GHz. A far infrared absorption line from the 
2i2-lii levels was also detected. We show an energy level diagram in Fig. [201 
The doubly deuterated species, D2H+,has been detected in the lio-loi line 
at 691.660 GHz from the para species (see [50]). 
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12.8 Electronic Angular Momentum 

In many respects the description of electronic angular momentum is similar to 
that of atomic fine structure as described by Russell-Saunders {L S) coupling. 
Each electronic state is designated by the symbol '^^'^^Aq, where 25' + 1 is 
the multiplicity of the state with S the electron spin and A is the projection 
of the electronic orbital angular momentum on the molecular axis in units 
of h. The molecular state is described as 2J, 7T, A etc., according to whether 
yl = 0,l,2,... . 

E is the projection of the electron spin angular momentum on the molec- 
ular axis in units of h (not to be confused with the symbol S, for A = 0). 
Finally, f2 is the total electronic angular momentum. For the Hund coupling 
case A, [2 ^ \A + Si [see e. g. [55]]. 

Since the frequencies emitted or absorbed by a molecule in the optical 
range are due to electronic state changes, many of the complications found 
in optical spectra are not encountered when considering transitions in the 
cm and mm range. However the electronic state does affect the vibrational 
and rotational levels even in the radio range. For most molecules, the ground 
state has zero electronic angular momentum, that is, a singlet sigma, 
state. For a small number of molecules such as OH, CH, C2H, or C3H, this is 
not the case; these have ground state electronic angular momentum. Because 
of this fact, the rotational energy levels experience an additional energy- level 
splitting, which is A doubling. This is a result of the interaction of the rotation 
and the angular momentum of the electronic state. This splitting causes the 
degenerate energy levels to separate. This splitting can be quite important for 
n states; for A and higher states it is usually negligible. The OH molecule 
is a prominent example for this effect. Semi-classically, the A doubling of 
OH can be viewed as the difference in rotational energy of the (assumed 
rigid) diatomic molecule when the electronic wave function is oriented with 
orbitals in a lower or higher moment of inertia state. We show a sketch of 
this in the upper part of Fig. 1211 Since the energy is directly proportional to 
the total angular momentum quantum number and inversely proportional to 
the moment of inertia, the molecule shown on the left has higher energy than 
the one shown on the right. 

There are also a few molecules for which the orbital angular momentum 
is zero, but the electron spins are parallel, so that the total spin is unity. 
These molecules have triplet sigma '^S ground states. The most important 
astrophysical example is the SO molecule; another species with a triplet S 
ground state is O2. These energy levels are characterized by the quantum 
number, N, and the orbital angular momentum quantum number J. The 
most probable transitions are those within a ladder, with A J — AN = ±1, 
but there can be transitions across N ladders. As with the OH molecule, some 
states of SO are very sensitive to magnetic fields. One could then use the 
Zeeman effect to determine the magnetic field strength. This may be difhcult 
since a 1 /xGauss field will cause a line splitting of only about 1 Hz in linear 
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Fig. 21. The lower energy levels of OH showing /l-doubling. F is the total angular 
momentum, including electron spin, while J is the rotational angular momentum 
due to the nuclear motion. The quantum number F includes hyperfine splitting of 
the energy levels. The parities of the states are also shown under the symbol F. 
The A doubling causes a splitting of the J states. In the sketch of the OH molecule, 
the shaded regions represent the electron orbits in the A state. The two unshaded 
spheres represent the O and H nuclei. The configuration shown on the left has the 
higher energy [38j . 



polarization. Even so, measurements of the polarization of the Jn = Iq — Oi 
line of SO ([35]) may allow additional determinations of interstellar magnetic 
fields. 

12.8.1 Molecules with Hindered Motions 

The most important hindered motion involve quantum mechanical tunneling; 
such motions cannot occur in classical mechanics because of energy consider- 
ations. A prime example of this phenomenon is the motion of the hydrogen 
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atom attached to oxygen in CH3OH, methanol. This H atom can move be- 
tween 3 positions between the three H atoms in the CH3 group. Another 
example is motion of the CH3 group in CH3COOH, methyl formate. These 
are dependent on the energy. At low energy these motions do not occur, while 
at larger energies are more important. For both methanol and methyl for- 
mate these motions allow a large number of transitions in the millimeter and 
sub-millimeter range. 

The description of energy levels of methyl formate follows the standard 
nomenclature. For methanol, however, this is not the case, due to historical 
developments. These energy levels are labelled as Jfc, where K can take on 
both positive and negative values as in Fig. [22l There is a similar scheme for 
naming energy levels of A type methanol, as A^. A and E type methanol are 
analogous to ortho and para species, in that these states are not normally 
coupled by collisions. 



ISO — 




Fig. 22. Energy-level diagram of E type methanol, CH3OH. This is an asymmet- 
ric top. The energy- level structure is typical of an almost prolate symmetric top 
molecule. The lines connecting the levels show the spontaneous transitions with the 
largest A coefficients from eac level; where the two largest A coefficients are within 
a factor of 2, both transitions are shown 1531. 



Torsionally excited states of methanol have been found in the interstellar 
medium. These are analogous to transitions from vibrationally excited states 
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to the ground state of a molecule. Another complexity is that because of its 
structure, there are two dipole moments, along either the c or the a axis 

13 Astronomical Applications 

In the following, methods to deterimine the parameters of molecular clouds 
are summarized 

13.1 Kinetic Temperatures 

A crucial input parameter for the LVG calculations is Tk- Linewidths of 
thermally excited species provide a definite value for Tk if the turbulent 
velocity can be neglected. The relation is 

rk = 21.2(TO/mH) {^Vtf 

where m is the molecule, mH is the mass of hydrogen, and AVt is the FWHP 
thermal width. Such a relation has been applied to NH3 lines (with success!) 
in quiescent dust clouds. 

Historically, Tk was obtained at first from the peak intensities of rota- 
tional transitions of CO, which has a small spontaneous decay rate. From 
the ratio of CO to ^'^CO line intensities, t(CO) ^ 1. After correcting for 
cloud size, from (|172p Tmb and T^x are directly related. In addition, the large 
optical depths reduce the critical density by a factor r, the line optical depth, 
so that for the J = 1 ^ line, the value of the critical density n* « 50 cm""^. 
Then, the level populations are determined by collisions, so the excitation 
temperature for the J = 1 — > transition is Tk. Usually, it is assumed that 
the beam filling factor is unity; for distant clouds or external galaxies, the 
filling factor is clearly less. 

An alternative to CO measurements makes use of the fact that radiative 
transitions between different K ladders in symmetric top molecules are for- 
bidden. Then the populations of the different K ladders are determined by 
collisions. Thus a method of determining Tk is to use the ratio of popula- 
tions in different K ladders of molecules such as NH3 at 1.3 cm, or CH3CCH 
and CHaCNin the mm/sub-mm range. This is also approximately true for 
different Ka ladders of H2CO. Since ratios are involved, beam filling factors 
play no role. Even for extended clouds, Tk values from CO and NH3 may not 
agree. This is because NH3 is more easily dissociated so must arise from the 
cloud interior. Thus for a cloud heated externally, the Tk from CO data will 
be larger than that from NH3. 

For NII3, the rotational transitions, ( J 1, K) (J, K), occur in the far 
infrared and have Einstein A coefficients of order Is^^; for inversion tran- 
sitions, A values are ~ 10"'' s~^ (see, e.g., Table 2). These non metastable 
states require extremely high II2 densities or intense far infrared fields to be 
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populated. Thus NH3 non metastable states (J > K) are not suitable for 
Tk determinations. Rather, one measures the inversion transitions in differ- 
ent metastable (J = K) levels. Populations cannot be transferred from one 
metastable state to another via allowed radiative transitions. This occurs via 
collisions, so the relative populations of metastable levels are directly related 
to Tk- The column densities are obtained from the inversion transitions from 
different metastable states, and convert these to column densities using (|167p 
to (|170p . If the NH3 lines are optically thick, one can use the ratios of satel- 
lite components to main quadrupole hyperfine components, in most cases, to 
determine optical depths. A large number of Tk determinations have been 
made using NH3 in dark dust clouds. Usually these involve the (J, K)={1,1) 
and (2,2) inversion transitions. 

13.2 Linewidths, Radial Motions and Intensity Distributions 

From the spectra themselves, the linewidths, Z\Vi/2, and radial velocities, 
Visr, give an estimate of motions in the clouds. The AY 1/2 values are a 
combination of thermal and turbulent motions. Observations show that the 
widths are supersonic in most cases. In cold dense cores, motions barely 
exceed Doppler thermal values. Detailed measurements of lines with moderate 
to large optical depths show that the shapes are nearly Gaussian. However, 
simple models in which unsaturated line shapes are Gaussians would give flat 
topped shapes at high optical depths. This is not found. More realistic models 
of clouds are those in which shapes are determined by the relative motion 
of a large number of small condensations, or clumps, which emit optically 
thick line radiation. If the motions of such small clumps are balanced by 
gravity, one can apply the virial theorem. Images of isolated sources can be 
used for comparing with models. One example is the attempt to characterize 
the kinetic temperature and the H2 density distributions from spectral line 
or thermal dust emission data. 

13.3 Determinations of H2 Densities 

If a level is populated by collisions with H2, the main collision partner, a 
first approximation to the H2 density can be had if one sets the collision 
rate, n(H2)(crw) equal to the A coefficient. The brackets indicate an average 
over velocities of H2, which are assumed to be Boltzmann distributed. The 
H2 density for a given transition which will bring Tex midway between the 
radiation temperature and Tk is referred to as the critical density, and is 
denoted by n* . That is n*{av) — A. However, this can be at best only an 
approximate estimate. For reliable determination of H2 densities n(H2) one 
must measure at least two spectral lines of a given species, estimates of the 
kinetic temperature, collision rates, and a radiative transport model. One 
can assume that the lines are optically thin, and use a statistical equilibrium 
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model, but the present approach is to apply the LVG model [35] ■ Clearly, the 
more lines measured, the more reliable the result. For hnear molecules such 
as CO or CS, it is not possible to separate kinetic temperature and density 
effects. For example, CO with Tk=10K will have a 3=2-1 line much weaker 
than the J=l-0 line no matter how high the density. However, if the plot of 
normalized CO column density versus energy above the ground state shows 
a turn over, that is a decrease in intensity, it is possible to find a unique 
combination of Tk and n(H2). 

13.4 Cloud Masses 
13.4.1 Virial Masses 

If we assume that only gravity is to be balanced by the motions in a cloud, 
then, for a uniform density cloud of radius i?, in terms of the line of sight 
FWHP velocity, virial equilibrium requires: 



Once again very optically thick lines should not be used in determining masses 
using pin)) . 

Another probe of such regions is thermal emission from dust (jll7p . Given 
the dust temperature, such data allow one to derive H2 abundances and cloud 
masses. 

13.4.2 Masses from Measurements of CO and ^^CO 

CO is by far the most widespread molecule with easily measured transitions. 
The excitation of CO is close to LTE and the chemistry is thought to be 
"well understood", so measurements of CO and CO isotopomers (and dust 
emission measurments!) are the most important tool(s) for estimating masses 
of molecular clouds. 

Even if all of the concepts presented in this section are valid, there may 
be uncertainties in the calculation of the column densities of CO. These 
arise from several sources which can be grouped under the general heading 
non-LTE effects. Perhaps most important is the uncertainty in the excitation 
temperature. While the ^■^CO emission might be thermalized even at densities 
< 100 cm^^, the less abundant isotopes may be sub-thermally excited, i.e., 
populations characterized by Tex < Tk. (this can be explained using the LVG 
approximation (|137p V Alternatively, if the cloud in question has no small scale 
structure, ^'^CO emission will arise primarily from the cloud interior, which 
may be either hotter or cooler than the surface; the optically thick ^^CO 
emission may only reflect conditions in the cloud surface. Another effect is 




(200) 



Millimeter / Sub-Millimeter Astronomy 



103 



that, although Tgx may describe the population of the J = and J = 1 states 
well, it may not for J > 1. That is, the higher rotational levels might not 
be thermalized because their larger Einstein A coefRcients lead to a faster 
depopulation. This lack of information about the population of the upper 
states leads to an uncertainty in the partition function. Measurements of 
other transitions and use of LVG models allow better accuracy. For most 
cloud models, LTE gives overestimates of the true ^^CO column densities 
by factors from 1 to 4 depending on the properties of the model and of the 
position in the cloud. Thus a factor of ~ two uncertainties should be expected 
when using LTE models. The final step to obtain the H2 column density is 
to assume a ratio of CO to H2. This is generally taken to be 10"'*. In spite 
of all these uncertainties, one most often attempts to relate measurements of 
the CO column density to that of H2; estimates made using lines of CO (and 
isotopomers) are probably the best method to obtain the H2 column density 
and mass of molecular clouds. 

An LVG treatment of the dependence of the total column density on the 
line intensity of the J = 2 ^ 1 line shows that a simple relation is valid for 
Tk from 15 K to 80 K, and n{B.2) from ~ 10^ to ~ lO'^cm"^. An assumption 
used in obtaining this relation is that the ratio of C^^O to H2 is 1.7 x 10"'^, 
which corresponds to (C/H2) = 10"*, and (^^Of^^O) = 500. The latter ratio 
is obtained from isotopic studies for molecular clouds near the Sun. Then we 
have 



The units of v are kms~^, of Tmb(C*^0, J = 2 — > 1) are Kelvin, and of Nn^ 
are cm~^. This result can be used to determine cloud masses, if the distance 
to the cloud is known, by a summation over the cloud, position by position, 
to obtain the total number of H2 . 

The total cloud masses obtained from the methods above, or similar meth- 
ods, is sometimes referred to as the 'CO mass'; this terminology can be mis- 
leading, but is frequently found in the literature. 

13.4.3 Masses from the X Factor 

In large scale surveys of the CO J = 1 ^ line in our galaxy and external 
galaxies, it has been found, on the basis of a comparison of CO with *^C0 
maps, that the CO integrated line intensities measure mass, even though this 
line is optically thick. The line shapes and intensity ratios along different 
lines of sight are remarkably similar for both *^C0 and ^"^CO line radiation. 
This can be explained if the total emission depends primarily on the number 
of clouds. If so, ^^CO line measurements can be used to obtain estimates 
of Nqq. Observationally, in the disk of our galaxy, the ratio of these two 
quantities varies remarkably little for different regions of the sky. 

This empirical approach has been followed up by a theoretical analysis. 
The basic assumption is that the clouds are virial objects, with self-gravity 
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balancing the motions. If these clouds are thought to consist of a large number 
of clumps, each with the same temperature, but sub thermally excited (i. e., 
Tex < 2k), then from an LVG analysis of the CO excitation, the peak intensity 
of the CO line will increase with ■\/n(H2), and the linewidth will also increase 
by the same factor, as can be seen from (|200p . The exact relation between 
the integrated intensity of the CO J = 1 ^ line and the column density 
of H2 must be determined empirically. Such a relation has also been applied 
to other galaxies and the center of our galaxy. However, the environment, 
such as the ISRF, may be very different and this may have a large effect on 
the cloud properties. For the disk of our galaxy, a frequently used conversion 
factor is: 

Nu,=X J TmbICO, J = 1 ^ 0) dw 

= 2.3 X 102°y"TMB(CO, J = 1 ^ 0)du. (202) 

where X — 2.3 x 10^" and Nn^ is m units of cm^^. By summing the intensities 
over the cloud, the mass in Mq is obtained. Strictly speaking, this relation is 
only valid for whole clouds. The exact value of the conversion factor between 
CO integrated line intensity and mass, X, is a matter of some dispute. 

13.5 Additional Topics 

13.5.1 Freezing Out on Grain Surfaces 

For H2 densities > lO^cm"^, one might expect a freezing out of molecules 
onto grains for cold, dense regions. From a simplified theory for H2 densities > 
10^ cm~^, the time for a molecule-grain collision is 3000 years, short compared 
to other time scales. Then CO and most other molecules might be condensed 
out of the gas phase, so that spectroscopic measurements cannot be used as 
a probe of very dense regions. Empirically N2H+, NH3 and H2D+ appear to 
remain in the gas phase even at low kinetic temperatures and high densities. 

13.5.2 Self Shielding of CO 

As is well established, CO is dissociated by line radiation. Since the optical 
depth of CO is large, this isotopomer will be self-shielded. If there is no fine 
spatial structure, selective dissociation will cause the extent of ^^CO to be 
greater than that of "'^'^CO, which will be greater than the extent of C^'^^O. 
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